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AN IMPROVED CLEANING COMPOSITION 



TLmlA of towitlon 

The present invention relates to novel alpha -amylase mutants having 
an amino acid sequence not found in nature, such mutants having an 
amino acid sequence wherein one or more amino acid residue Is) of a 
precursor alpha -amylase, specifically any oxidizable amino acid, 
have been substituted with a different amino acid. The mutant 
enzymes of the present invention exhibit altered stability/activity 
profiles including but not limited to altered oxidative stability, 
altered pH performance profile, altered specific activity and/or 
altered thermostability. 

Background of th# Invnfcltm 

Alpha-amylases (alpha-l,4-glucan-4-glucanohydrolase, EC3.2.1.1) 
hydrolyze internal alpha-1, 4-glucosidic linkages in starch largely 
at random, to produce smaller molecular weight malto-dextrins. 
Alpha-amylases are of considerable commercial value, being used in 
the initial stages (liquefaction) of starch processing; in alcohol 
production; as cleaning agents in detergent matrices; and in the 
textile industry for starch desizing. Alpha-amylases are produced 
by a wide variety of microorganisms including Bacillus and 
Aspergillus, with most commercial amylases being produced from 
bacterial sources such as B. lichanifoxmis, B. amyloliquef&ciens, B. 
subtil is, or B. stearothermophilus . In recent years the preferred 
enzymes in commercial use have been those from £. licheniformis 
because of their heat stability and performance, at least at neutral 
and mildly alkaline pH's. 

Previously there have been studies using recombinant DNA techniques 
to explore which residues are important for the catalytic activity 
of amylases and/or to explore the effect of modifying certain amino 
acids within the active site of various amylases (Vihinen, M. et al. 
(1990) J. Bichem. 107:267-272; Holm, L. et al. (1990) Protein 
Engine ring 9:181-191; Takase, K. et al. (1992) Biochemica t 
Biophysica Acta, 1120 :281-288 ; Matsui, I. t al. (1992) F bs Letters 
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Vol. 310, No, 3, pp. 216-218); which residues ar important for 
thermal stability (Suzuki, Y. t al. (1989) J. Biol. Chem. 
264:18933-18938); and one group has used such methods to introduce 
mutations at various histidine residues in a B. licheniformis 
amylase, the rationale for making substitutions at histidine 
residues was that £. licheniformis amylase (known to be 
thermostable) when compared to other similar Bacillus amylases, has 
an excess of histidines and, therefore, it was suggested that 
replacing a histidine could affect the thermostability of the enzyme 
(Declerek, N. et al. (1990) J, Biol. Chem. 265:15481-15488; FR 2 665 
178-A1; Joyet, P. et al. (1992) Bio/Technology 10:1579-1583). 

It has been found that alpha-amylase is inactivated by hydrogen 
peroxide and other oxidants at pH's between 4 and 10.5 as described 
in the examples herein. Commercially, alpha-amylase enzymes can be 
used under dramatically different conditions such as both high and 
low pH conditions, depending on the commercial application. For 
example, alpha -amylases may be used in the liquefaction of starch, a 
process preferably performed at a low pH (pH <5.5). On the other 
hand, amylases may be used in commercial dish care or laundry 
detergents, which often contain oxidants such as bleach or peracids, 
and which are used in much more alkaline conditions. 

In order to alter the stability or activity profile of airylase 
enzymes under varying conditions, it has been found that selective 
replacement, substitution or deletion of oxidizable amino acids, 
such as a methionine, tryptophan, tyrosine, histidine or cysteine, 
results in an altered profile of the variant enzyme as compared to 
its precursor. Because currently commercially available amylases 
are not acceptable (stable) under various conditions, there is a 
need for an amylase having an altered stability and/or activity 
profile. This altered stability (oxidative, thermal or pH 
performance profile) can be achieved while maintaining adequate 
enzymatic activity, as compared to the wild-type or precursor 
enzyme. The characteristic affected by introducing such mutations 
may be a change in oxidative stability while maintaining thermal 
stability or vice versa. Additionally, the substitution of 
different amino acids for an oxidizable amino acids in the alpha- 
amylase precursor sequence or the deletion of one or more oxidizable 
amino acid(s) may result in altered enzymatic activity at a pH other 
.than that which is consid red optimal for the pr cursor alpha- 
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amylase. In ther words, th mutant enzymes of th present 
invention may also hav altered pH p rformanc profiles, which may 
be due to the enhanced oxidative stability of th enzyme. 

p*,^*™ fc fa« Invention 

The present invention relates to novel alpha-amylase mutants that 
are the expression product of a mutated UNA sequence encoding an 
alpha -ainy las e, the mutated DNA sequence being derived from a 
precursor alpha-amylase by the deletion or substitution 
(replacement) of one or more oxidizable amino acid. In one 
preferred embodiment of the present invention the mutant result from 
substituting a different amino acid for one or more methionine 
residue (s) in the precursor alpha-amylase. In another embodiment of 
the present invention the mutants comprise a substitution of one or 
more tryptophan residue alone or in combination with the 
substitution of one or more methionine residue in the precursor 
alpha-amylase. Such mutant alpha -amylases, in general, are obtained 
by in vitro modification of a precursor DNA sequence encoding a 
naturally occurring or recombinant alpha-amylase to encode the 
substitution or deletion of one or more amino acid residues in a 
precursor amino acid sequence. 

Preferably the substitution or deletion of one or more amino acid in 
the amino acid sequence is due to the replacement or deletion of one 
or more methionine, tryptophan, cysteine, histidine or tyrosine 
residues in such sequence, most preferably the residue which is 
changed is a methionine residue. The oxidizable amino acid residues 
may be replaced by any of the other 20 naturally occurring amino 
acids. If the desired effect is to alter the oxidative stability of 
the precursor, the amino acid residue may be substituted with a non- 
oxidizable amino acid (such as alanine, arginine, asparagine, 
aspartic acid, glutamic acid, glut amine, glycine, isoleucine, 
leucine, lysine, phenylalanine, proline, serine, threonine, or 
valine) or another oxidizable amino acid (such as cysteine, 
methionine, tryptophan, tyrosine or histidine, listed in order of 
most easily oxidizable to less readily oxidizable). Likewise, if 
the desired effect is to alter thermostability, any of the other 20 
naturally occurring amino acids may be substituted (i.e., cysteine 
may be substituted for methionine) . 
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Preferred mutants comprise th substitution of a met hi on in residue 

quivalent to any of the m thionine residues found in B. 
licheniformis alpha-amylase (+8, +15, +197 , +256 , +304, +366 and 
+438)* Most preferably the methionine to be replaced is a 
methionine at a position equivalent to position +197 or +15 in B. 
licheniformis alpha-amylase • Preferred substitute amino acids to 
replace the methionine at position +197 are alanine (A) , isoleucine 
(I), threonine (T) or cysteine (C) . The preferred substitute amino 
acids at position +15 are leucine (L) , threonine (T) , asparagine 
(N) , aspartate (D) , serine (S) , valine (V) and isoleucine (I), 
although other substitute amino acids not specified above may be 
useful. Two specifically preferred mutants of the present invention 
are M197T and M15L. 

Another embodiment of this invention relates. to mutants comprising 
the substitution of a tryptophan residue equivalent to any of the 
tryptophan residues found in B. licheniformis alpha-aznylase (see 
Fig. 2) . Preferably the tryptophan to be replaced is at a position 
equivalent to +138 in B. licheniformis alpha-amylase. A mutation 
(substitution) at a tryptophan residue may be made alone or in 
combination with mutations at other oxidizable amino acid residues. 
Specifically, it may be advantageous to modify by substitution at 
least one tryptophan in combination with at least one methionine 
(for example, the double mutant +138/+197) . 

The alpha-aznylase mutants of the present invention, in general, 
exhibit altered oxidative stability in the presence of hydrogen 
peroxide and other oxidants such as bleach or peracids, or, more 
specific, milder oxidants such as chloramine-T . Mutant enzymes 
having enhanced oxidative stability will be useful in extending the 
shelf life and bleach, perborate, per carbonate or peracid 
compatibility of amylases used in cleaning products. Similarly, 
reduced oxidative stability may be useful in industrial processes 
that require the rapid and efficient quenching* of enzymatic 
activity. The mutant enzymes of the present invention may also 
demonstrate a broadened pH performance profile whereby mutants such 
as M15L show stability for low pH starch liquefaction and mutants 
such as M197T show stability at high pH cleaning product conditions. 
The mutants of the present invention may also have altered thermal 
stability wh reby the mutant may have enhanc d stability at either 
high or low temp matures. It is underst d that any chang 
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(incr ase r decrease) in the mutant's enzymatic characteristic (s) , 
as compared to its precursor, may be ben ficial depending on the 
desired end use of the mutant alpha-amylase. 

In addition to starch processing and cleaning applications, variant 
amylases of the present invention may be used in any application in 
which known amylases are used, for example, variant amylases can be 
used in textile processing, food processing, etc. Specifically, it 
is contemplated that a variant enzyme such as M197C, which is easily 
inactivated by oxidation, would be useful in a process where it is 
desirable to completely remove amylase activity at the end of the 
process, for example, in frozen food processing applications. 

The preferred alpha-amylase mutants of the present invention are 
derived from a Bacillus strain such as B. licheniformis, B. 
amyloliquefaciens, and B. stearothexwophilus, and most preferably 
from Bacillus licheniformis. 

In another aspect of the present invention there is provided a novel 
form of the alpha-amylase normally produced by 3. lichenifoxmis. 
This novel form, designated as the A4 form, has an additional four 
alanine residues at the N- terminus of the secreted amylase. (Fig. 
4b.) Derivatives or mutants of the A4 form of alpha-amylase are 
encompassed within the present invention. By derivatives or mutants 
of the A4 form, it is meant that the present invention comprises the 
A4 form alpha-amylase containing one or more additional mutations 
such as, for example, mutation (substitution, replacement or 
deletion) of one or more oxidizable amino acid(s). 

In a composition embodiment of the present invention there are 
provided detergent compositions, liquid, gel or granular, coirqprising 
the alpha-amylase mutants described herein. Particularly preferred 
are detergent compositions comprising a +197 position mutant either 
alone or in combination with other enzymes such as endoglycosidases, 
cellulases, proteases, lipases or other amylase enzymes. 
Additionally, it is contemplated that the compositions of the 
present invention may include an alpha-amylase mutant having more 
than one site-specific mutation. 

In yet anoth r composition embodiment of the present invention th re 
are provided compositions useful in starch processing and 
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particularly starch liquefaction* The starch liquefaction 
compositions of the present invention preferably comprise an alpha- 
amylase mutant having a substitution or deletion at position M15. 
Additionally, it is contemplated that such compositions may comprise 
additional components as known to those skilled in the art, 
including, for example, antioxidants, calcium, ions, etc* 

In a process aspect of the present invention there are provided 
methods for liquefying starch, and particularly granular starch 
slurries, from either a wet or dry milled process. Generally, in 
the first step of. the starch degradation process, the starch slurry 
is gelatinized by heating at a relatively high temperature (up to 
about 110°C) . After the starch slurry is gelatinized it is 
liquefied and dextrinized using an alpha -amylase. The conditions 
for such liquefaction are described in commonly assigned US patent 
applications 07/7B5,624 and 07/785,623 and US Patent 5,180,669, the 
disclosure of which are incorporated herein by reference. The 
present method for liquefying starch comprises adding to a starch 
slurry an effective amount of an alpha- amylase of the present 
invention, alone or in combination with additional excipients such 
as an antioxidant, and reacting the slurry for an appropriate time 
and temperature to liquefy the starch. 

A further aspect of the present invention comprises the DNA encoding 
the mutant alpha-amylases of the present invention (including A4 
form and mutants thereof) and expression vectors encoding the DNA as 
well as host cells transformed with such expression vectors. 

ariaf Paaefitatiem of tKa Prtnlnoi 

Fig. 1 shows the DNA sequence of the gene for alpha-amylase from B, 
licheniformis (NCIB8061), Seq ID No 31, and deduced translation 
product as described in Gray, 6. et al. (1986) J. Bacter. 166:635- 
643. 

Fig. 2 shows the amino acid sequence of the mature alpha-amylase 
enzyme from B. licheniformis (NCIB8061) , Seq ID No 32. 

Fig. 3 shows an alignment of primary structures of Bacillus alpha- 
amylases. The B. licheniformis amylase (Am-Lich) , Seq ID No 33, is 
described by Gray, G. t al. (1986) J. Bact. 166:635-643; the B. 
ajnyloliquefaciens amylase (Am-Axnylo) , Seq ID No 34, is described by 



6 



I 



WO 96/05295 PCT/US95/10426 

Takkinen, K. et al. (1983) J. Biol. Chem. 258:1007-1013; and the B. 
stearothezmophilus (Am-St aro) l Seq ID No 35, is described by Ihara, 
H. et al. (1985) J. Biochem. 98:95-103. 

Fig. 4a shows the amino acid sequence of the mature alpha -amylase 
variant M197T, Seq ID No 36. 

Fig. 4b shows the amino acid sequence of the A4 form of alpha- 
amylase from B. Jichenifoxrois NCIB8061, Seq ID No 37. Numbering is 
from the N- terminus, starting with the four additional alanines. 

Fig. 5 shows plasmid pA4BL wherein BIAA refers to B. licheniformis 
alpha-amylase gene, FstI to Ss tl; Amp* refers to the ampicillin- 
resistant gene from pBR322; and CAT refers to the Chloramphenicol- 
resistant gene from pC194. 

Fig. 6 shows the signal sequence-mature protein junctions for B. 
licheniformis (Seq ID No 38). B. subtilis (Seq ID No 39), B. 
licheniformis in pA4BL (Seq ID No 40) and B. lichenifoxmis in pBLapr 
(Seq ID No 41) . 

Fig. 7a shows inactivation of certain alpha-amylases (SpezymeG AA20 
and M197L (A4 form) with 0.88M H 2 0 3 at pH 5.0, 25°C. 

Fig. 7b shows inactivation of certain alpha-amylases (Spezyme® AA20, 
M197T) with 0.88M H a 0 2 at pH 10.0, 25°C. 

Fig. 7c shows inactivation of certain alpha-amylases (Spezymefc AA20, 
M15L) with 0.88M H 2 0 2 at pH 5.0, 25°C. 

Fig. 8 shows a schematic for the production of M197X cassette 
mutants . 

Fig. 9 shows expression of M197X variants. 

Fig. 10 shows thermal stability of M197X variants at pH 5.0, SiflM 
CaCl 2 at 95 °C for 5 mins. 

Figs. 11a and lib show inactivation of certain amylases in automatic 
dish care detergents. Fig. 11a shows th stability of certain 
amylases in Cascade™ (a commercially available dish car product) at 
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65°C in th presence or absence of starch. Fig. lib shows the 
stability f certain amylases in Sunlight** (a comm rcially available 
dish care product) at 65 °C in the presence or absence of starch. 

Fig. 12 shows a schematic for the production of M15X cassette 
mutants. 

Fig. 13 shows expression of M15X variants. 

Fig. 14 shows specific activity of M15X variants on soluble starch. 

Fig. 15 shows heat stability of M15X variants at 90°C, pH 5.0, 5rtM 
CaCl 2 , 5 mins. 

Fig. 16 shows specific activity on starch and soluble substrate, and 
performance in jet liquefaction at pH 5.5, of M15 variants as a 
function of percent activity of B. licheniformis wild- type. 

Fig. 17 shows the inactivation of J9. licheniformis alpha-amylase 
(AA20 at 0.65 mg/ml) with chloramine-T at pH 8.0 as compared to 
variants M197A (1.7 mg/ml) and M197L (1.7 mg/ml). 

Fig. 18 shows the inactivation of B. licheniformis alpha-amylase 
(AA20 at 0.22 mg/ml) with chloramine-T at pH 4.0 as con^ared to 
variants M197A (4.3 mg/ml) and M197L (0.53 mg/ml). 

Fig. 19 shows the reaction of £. lichenifoxmis alpha r amylase (AA20 
at 0.75 mg/ml) with chloramine-T at pH 5.0 as coxqpared to double 
variants M197T/W138F (0.64 mg/ml) and M197T/W138Y (0.60 mg/ml). 

Fig. 20 shows the stability testing results of various alpha-amylase 
multiple mutants incorporated in automatic dish detergent (ADD) 
formulations at temperatures from room temperature increased to 
65°C. 

Fig. 21 shows the stability of certain amylase mutants (compared to 
wild-type) in an automatic dish detergent at room temperature over 
0-30 days, as determined by percent activity remaining over time. 
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Fig. 22 shows the stability of certain amylase mutants (compared to 
wild-type) in an automatic dish d tergent at 38°C (100°F) with 80% 
relative humidity over 0-30 days. 

Detailed Description qZ the Invention 

It is believed that amylases used in starch liquefaction may be 
subject to some form of inactive tion due to some activity present in 
the starch slurry (see commonly owned US applications 07/785,624 and 
07/785,623 and US Patent 5 , 180,669, issued January 19, 1993, 
incorporated herein by reference) . Furthermore, use of an amylase 
in the presence of oxidants, such as in bleach- or peracid- 
containing detergents, may result in partial or complete 
inactivation of the amylase. Therefore, the present invention 
focuses on altering the oxidative sensitivity of amylases. The 
mutant enzymes of the present invention may also have an altered pH 
profile and/or altered thermal stability which may be due to the 
enhanced oxidative stability of the enzyme at low or high pH's. 

Alpha-amylase as used herein includes naturally occurring amylases 
as well as recombinant amylases. Preferred amylases in the present 
invention are alpha-amylases derived from B. licheniformis or B. 
stearothermophilus , including the A4 form of alpha-amylase derived 
from B. licheniformis as described herein, as well as fungal alpha* 
amylases such as those derived from Aspergillus (i.e., A. oryzae and 
A. jiiger) . 

Recombinant alpha-amylases refers to an alpha-amylase in which the 
ONA sequence encoding the naturally occurring alpha-amylase is 
modified to produce a mutant DNA sequence which encodes the 
substitution, insertion or deletion of one or more amino acids in 
the alpha-amylase sequence. Suitable modification methods are 
disclosed herein, and also in commonly owned US Patents 4,760,025 
and 5,185,258, the disclosure of which are incorporated herein by 
reference . 

Homologies have been found between almost all endo-axnylases 
sequenced to date, ranging from plants, mammals, and bacteria 
(Nakajima, R.T. et al. (1986) Appl. Microbiol. Biotechnol. 23:355- 
360; Rogers, J.C. (1985) Biochem. Biophys. Res. Commun. 12t:470- 
476). There are four areas of particularly high h mology in certain 
Bacillus amylas s, as shown in Fig. 3, wherein the underlined 
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sections d signate the ar as f hiyh homology. Purth r, sequence 
alignments have been used to map the relationship between Bacillus 
endo-amylases (Feng, D.F. and Doolittle, R.F. (1987) J. Molec. Evol. 
35:351-360) . The relative sequence homology between B. 
stearothermophilus and B. lieheniformis amylase is about 66%, as 
determined by Holm, L. et al. (1990) Protein Engineering 1 (3) pp. 
181-191. The sequence homology between B. lieheniformis and B. 
amyloliquefaciens amylases is about 81%, as per Holm, L. et al., 
supra. While sequence homology is important, it is generally 
recognized that structural homology is also important in comparing 
amylases or other enzymes. For example, structural homology between 
fungal amylases and bacterial (Bacillus) amylase have been suggested 
and, therefore, fungal amylases are encompassed within the present 
invention . 

An alpha-amylase mutant has an amino acid sequence which is derived 
from the amino acid sequence of a precursor alpha-amylase. The 
precursor alpha-amylases include naturally occurring alpha-amylases 
and recombinant alpha-amylases (as defined) . The amino acid 
sequence of the alpha-amylase mutant is derived from the precursor 
alpha-amylase amino acid sequence by the substitution, deletion or 
insertion of one or more amino acids of the precursor amino acid 
sequence. Such modification is of the precursor DNA sequence which 
encodes the amino acid sequence of the precursor alpha-amylase 
rather than manipulation of the precursor alpha-amylase enzyme per 
se. Suitable methods for such manipulation of the precursor DNA 
sequence include methods disclosed herein and in commonly owned US 
patent 4,760,025 and 5,185,258. 

Specific residues corresponding to positions M197, Ml 5 and W138 of 
Bacillus lieheniformis alpha-amylase are identified herein for 
substitution or deletion, as are all methionine, histidine, 
tryptophan, cysteine and tyrosine positions. The amino acid 
position number (i.e., +197) refers to the number assigned to the 
mature Bacillus lieheniformis alpha-amylase sequence presented in 
Fig. 2. The invention, however, is not limited to the mutation of 
this particular mature alpha-amylase (B. lieheniformis) but extends 
to precursor alpha-amylases containing amino acid residues at 
positions which are equivalent to the particular identified residue 
in B, lieheniformis alpha-amylas . A residu (amino acid) f a 
precursor alpha-amylas is qui va lent to a residu of B. 
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licheniformis alpha-amylase if it is either homologous (i.e., 
corresponding in position in either primary or tertiary structure) 
or analogous to a specific residue or portion of that residue in B. 
lichenifoimis alpha-amylase (i.e., having the same or similar 
functional capacity to combine, react, or interact chemically or 
structurally) • 

In order to establish homology to primary structure, the amino acid 
sequence of a precursor alpha-amylase is directly compared to the B. 
licheniformis alpha-amylase primary sequence and particularly to a 
set of residues known to be invariant to all alpha-amylases for 
which sequence is known, as seen in Fig. 3. It is possible also to 
determine equivalent residues by tertiary structure: crystal 
structures have been reported for porcine pancreatic alpha-amylase 
(Buisson, G. et al. (1987) EMBO J. 6:3909-3916) ; Taka-anylase A from 
Aspergillus oryzae (Matsuura, Y. et al. (1984) J. Biochem. (Tokyo) 
95:697-702); and an acid alpha-amylase from A. niger (Boel, E. et 
al. (1990) Biochemistry 29:6244-6249), with the former two 
structures being similar. There are no published structures for 
Bacillus alpha-amylases, although there are predicted to be common 
super-secondary structures between glucanases (MacGregor, E.A. & 
Svensson, B. (1989) Biochem. J. 259:145-152) and a structure for the 
B. stearothermophilus enzyme has been modeled on that of Taka- 
amylase A (Holm, L. et al. (1990) Protein Engineering 3:181-191). 
The four highly conserved regions shown in Fig. 3 contain many 
residues thought to be part of the active-site (Matsuura, Y. et al. 
(1984) J. Biochem. (Tokyo) 95:697-702; Buisson, G. et al. (1987) 
EMBO J. 6:3909-3916; Vihinen, M. et al. (1990) J. Biochem. 107:267- 
272) including, in the licheniformis numbering, HislOS; Arg229; 
Asp231; His235; Glu261 and Asp328. 

Expression vector as used herein refers to a DNA construct 
containing a DMA sequence which is operably linked to a suitable 
control sequence capable of effecting the expression of said DNA in 
a suitable host. Such control sequences may include a promoter to 
effect transcription, an optional operator sequence to control such 
transcription, a sequence encoding suitable mRNA ribosome -binding 
sites, and sequences which control termination of transcription and 
translation. A preferred promoter is the B. subtilis aprE promoter. 
The vector may be a plasmid, a phage particle, or simply a potential 
genomic insert. One transform d into a suitable host, the vector 
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may replicate and function independ ntly of tb host genome, or may, 
in some instances, integrat into the genome itself. In the present 
specification, plasmid and vector are sometimes used interchangeably 
as the plasmid is the most commonly used form of vector at present. 
However, the invention is intended to include such other forms of 
expression vectors which serve equivalent functions and which are, 
or become, known in the art. 

Host strains (or cells) useful in the present invention generally 
are procaryotic or eucaryotic hosts and include any transformable 
microorganism in which the expression of alpha -amylase can be 
achieved. Specifically, host strains of the same species or genus 
from which the alpha-amylase is derived are suitable, such as a 
Bacillus strain. Preferably an alpha-amylase negative Bacillus 
strain (genes deleted) and/or an alpha-amylase and protease deleted 
Bacillus strain such as Bacillus subtilis strain BG2473 
{LamyE,Lapr,&npr) is used. Host cells are transformed or 
transfected with vectors constructed using recombinant DNA 
techniques. Such transformed host cells are capable of either 
replicating vectors encoding the alpha-amylase and its variants 
(mutants) or expressing the desired alpha-amylase. 

Preferably the mutants of the present invention are secreted into 
the culture medium during fermentation. Any suitable signal 
sequence, such as the aprE signal peptide, can be used to achieve 
secretion. 

Many of the alpha-amylase mutants of the present invention are 
useful in formulating various detergent compositions, particularly 
certain dish care cleaning compositions, especially those cleaning 
compositions containing known oxidants. Alpha-amylase mutants of 
the invention can be formulated into known powdered, liquid or gel 
detergents having pH between 6.5 to 12.0. Suitable granular 
composition may be made as described in commonly owned US patent 
applications 07/429,881, 07/533,721 and 07/957,973, all of which are 
incorporated herein by reference. These detergent cleaning 
compositions can also contain other enzymes, such as known 
proteas.es, lipases, cellulases, endoglycosidases or other amylases, 
as well as builders, stabilizers or other excipients known to those 
skill d in the art. These enzymes can be pres nt as co-granules or 
as bl nded mixes or in any other manner known to thos skilled in 
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the art. Furthermore, it is contemplated by the pr sent invention 
that multiple mutants may be useful in cl aning or ther 
applications. For example, a mutant enzyme having changes at both 
415 and +197 may exhibit enhanced performance useful in a cleaning 
product or a multiple mutant comprising changes at 4197 and 4138 may 
have improved performance. Specifically preferred mutant enzymes 
for use in cleaning products, and particularly dish care 
formulations, include but are not limited to M15T/M197T; M15S/M197T; 
W138Y/M197T; M15S/W138Y/M197T; and M15T/W13BY/M197T. 

Another embodiment of the present invention comprises the 
combination of the mutant alpha-amylase enzymes described herein in 
combination with other enzymes (i.e., proteases, lipases, 
cellulases, etc.), and preferably oxidatively stable proteases. 
Suitable oxidatively stable proteases include genetically engineered 
proteases such as those described in US Re 34606, incorporated 
herein by reference, as well as commercially available enzymes such 
as DURAZYM (Novo Nordisk), MAXAPEM (Gist-brocades) and PORAFECT OXP 
(Genencor International, Inc.). Suitable methods for making such 
protease mutants (oxidatively stable proteases), and particularly 
such mutants having a substitution for the methionine at a position 
equivalent to M222 in B. amyloliquefaciens, are described in US Re 
34 606. Suitable methods for determining "equivalent" positions in 
other subtilisins are provided in Re 34606, EP 257,446 and USSN 
212,291, which are incorporated herein by reference. 

As described previously, alpha-amylase mutants of the present 
invention may also be useful in the liquefaction of starch. Starch 
liquefaction, particularly granular starch slurry liquefaction, is 
typically carried out at near neutral pH's and high temperatures. 
As described in commonly owned US applications 07/788,624 and 
07/785,623 and US Patent 5,180,669, it appears that an oxidizing 
agent or inactivating agent of some sort is also present in typical 
liquefaction processes, which may affect the enzyme activity; thus, 
in these related patent applications an antioxidant is added to the 
process to protect the enzyme. 

Based on the conditions of a preferred liquefaction process, as 
described in commonly owned US applications 07/788,624 and 
07/785,623 and US Patent 5,180,669, namely low pH, high temperature 



13 



WO 96/05295 PCT/US9S/10426 

and potential oxidation conditions, preferr d mutants f the present 
invention for use in liquefaction processes comprise mutants 
exhibiting altered pH performance profiles (i.e., 1 w pH profile, pH 
<6 and preferably pH <5.5), and/or altered thermal stability (i.e., 
high temperature, about 90°-110°C) , and/or altered oxidative 
stability (i,e», enhanced oxidative stability). 

Thus, an improved method for liquefying starch is taught by the 
present invention, the method comprising liquefying a granular 
starch slurry from either a wet or dry milling process at a pH from 
about 4 to 6 by adding an effective amount of an alpha-amylase 
mutant of the present invention to the starch slurry; optionally 
adding an effective amount of an antioxidant or other excipient to 
the slurry; and reacting the slurry for an appropriate time and 
temperature to liquefy the starch. 

The following is presented by way of example and is not to be 
construed as a limitation to the scope of the claims. Abbreviations 
used herein, particularly three letter or one letter notations for 
amino acids are described in Dale, J.VJ., Molecular Genetics of 
Bacteria, John Wiley & Sons, (1989) Appendix B. 



Sample 1 

Substitutions for the Methionine Residues in 
J3. liQhenifQzmis Alpha-ftmYlflse 

The alpha-amylase gene (Fig. 1) was cloned from B, licheniformis 
NCIB8061 obtained from the National Collection of Industrial 
Bacteria, Aberdeen, Scotland (Gray, 6. et al. (1986) J. Bacteriology 
166:635*643). The 1.72kb Pstl-SstI fragment, encoding the last 
three residues of the signal sequence; the entire mature protein and 
the terminator region was subcloned into M13MP16. A synthetic 
terminator was added between the Bell and SstI sites using a 
synthetic oligonucleotide cassette of the form: 

Beiz Sets 

5 4 GATCAA&ACATAAAAAACCTCCCTTGG 3 ' 

3 ' TTTTGTA lU ' lUU ' l t^CCGGAACCGGGGCGGCCAAAAAATAATAAAAAC 5 ' 

Seq ID No 1 
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designed to contain the B. amyloliquefaciens subtilisin 
transcriptional terminator (Wells t al. (1983) Nucleic Acid 
Research 11:7911-7925). 

Site-directed mutagenesis by oligonucleotides used essentially the 
protocol of Zoller, M. et al* (1983) Meth. Enzymol. 100:468-500: 
briefly, 5 1 -phosphorylated oligonucleotide primers were used to 
introduce the desired mutations on the M13 single-stranded DNA 
template using the oligonucleotides listed in Table I to substitute 
for each of the seven methionines found in B. licheniformis alpha- 
amylase. Each mutagenic oligonucleotide also introduced a 
restriction endonuclease site to use as a screen for the linked 
mutation. 
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TABLE I 

Mutagenic OligonuclfiQ tides for the Substitution of the 

Methionine Residues in B. licheniformis Aloha- Amylase 
M8A 

5 ' -T GGG ACG CTG OCG CAG TAe T TT GAA TGG T-3 • Seq ID No 2 

Seal* 

M15L 

5 • -TG ATG CAG TAe T TT GAA TGG TAC C TG CCC AAT GA-3 * Seq ID No 3 

Seal* Kpnl+ 

M197L 

5 ' -GAT TAT TTG TTG TAT GCC GAT ATC GAC TAT GAC CAT -3 ' Seq ID No 4 

EcoRV+ 

K256A 

5 • -CG GGG AAG GAQ ace T TT ACG GTA GCT-3 • Seq ID No 5 

StUl4 

M304L 

5 ' -GC GGC TAT G AC tTa AG G AAA TTG C-3 • Seq ID No 6 

AflH* 

M366A 

5 ' -C TAC GGG GAT OCA TA C GGG ACG A-3 • Seq ID No 7 

MsiX* 

M366Y 

5 ' -C TAC GGG GAT TAC TAC GGG ACc AAo G GA GAC TCC C-3 1 Seq ID No 8 

Styl+ 

M438A 

5'-CC GGT GGG GCC AAG CGo occ TAT GTT GGC CGG CAA A-3' Seq ID No 9 

Sfil* 



Bold letter indicate base changes introduced by oligonucleotide. 

Codon changes indicated in the form MBA, where methionine (M) at 
position +8 has been changed to alanine (A) . 

Underlining indicates restriction endonuclease site introduced by 
oligonucleotide . 

The heteroduplex was used to trans feet E. coli mutL cells (Kramer et 
al. (1984) Cell 38:879) and, after plaque-purification, clones were 
analyzed by restriction analysis of the RFl's. Positives were 
confirmed by dideoxy sequencing (Sanger et al. (1977) Proc. Natl. 
Acad. Sci. U.S.A. 74:5463-5467) and the Pstl-SstI fragments for each 
subcloned into an £. coJi v ctor, plasmid pA4BL. 
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Plasmid PA4BL 

Following th methods described in US application 860,468 (Power et 
al.), which is incorporated herein by reference, a silent PstI site 
was introduced at codon +1 (the first amino-acid following the 
signal cleavage site) of the aprE gene from pS168-l (Stahl, M.L. and 
Ferrari, E. (1984) J. Bacter. 158:411-418). The aprE promoter and 
signal peptide region was then cloned out of a pJHIOl plasmid 
(Ferrari, F.A. et al. (1983) J. Bacter. 154:1513-1515) as a Hindlll- 
Pstl fragment and subcloned into the pUClB-derived plasmid JM102 
(Ferrari, E. and Hoch, J.A, (1989) Bacillus, ed. C.R. Harwood, 
Plenum Pub., pp. 57-72). Addition of the Pstl-SstI fragment from B. 
licheniformis alpha-amylase gave pA4BL (Fig. 5) having the resulting 
aprE signal peptide-amylase junction as shown in Fig. 6. 

Transformation Into fi. suhtilis 

pA4BL is a plasmid able to replicate in E. eoli and integrate into 
the B. subtilis chromosome. Plasmids containing different variants 
were transformed into B. subtilis (Anagnostopoulos, C. and Spizizen, 
J. (1961) J. Bacter. 81:741-746) and integrated into the chromosome 
at the aprE locus by a Campbell -type mechanism (Young, M. (1984) J. 
Gen. Microbiol. 130:1613-1621). The Bacillus subtilis strain BG2473 
was a derivative of 1168 which had been deleted for amylase ilaiqyE) 
and two proteases (Aapr, Anpr) (Stahl, M.L. and Ferrari, E, , J. 
Bacter. 158:411-418 and US Patent 5,264,366, incorporated herein by 
reference). After transformation the sacU32 (Hy) (Henner, D.J. et 
al. (1988) J. Bacter. 170:296-300) mutation was introduced by PBS-2 
mediated transduction (Hoch, J.A. (1983) 154:1513-1515). 

N- terminal analysis of the amylase expressed from pA4BL in B. 
subtilis showed it to be processed having four extra alanines at the 
K- terminus of the secreted amylase protein ("A4 form"). These extra 
residues had no significant, deleterious effect on the activity or 
thermal stability of the A4 form and in some applications may 
enhance performance. In subsequent experiments the correctly 
processed forms of the licheniformis amylase and the variant M197T 
were made from a very similar construction (see Fig. 6), 
Specifically, the 5* end of the A4 construction was subcloned on an 
EcoRI-Sstll fragment, from pA4BL (Fig. 5) into M13BM20 (Boehringer 
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Mannheim) in order t obtain a coding-strand template for the 
mutagenic oligonucl otid below: 

5 ' -CAT GAG CGT CCC ATT AAG ATT TGC AGC CTG CGC AGA CAT GTT GCT-3 ' 

Seq ID No 10 

This primer eliminated the codons for the extra four N-terminal 
alanines, correct forms being screened for by the absence of the 
PstI site. Subcloning the EcoRI-Sstll fragment back into the pA4BL 
vector (Fig. 5) gave plasmid pBLapr. The M197T substitution could 
then be moved, on a SstXX-SstI fragment, out of pA4BL (M197T) into 
the complementary pBLapr vector to give plasmid pBLapr (M197T) . N- 
terminal analysis of the amylase expressed from pBLapr in B. 
subtilis showed it to be processed with the same N- terminus found in 
B. licheniformis alpha-amylase . 

Example 2 

Oxidative Sensitivity of Methionine Varinntc; 
B. licheniformis alpha-amylase, such as Spezymett AA20 (commercially 
available from Genencor International, Inc.), is inactivated rapidly 
in the presence of hydrogen peroxide (Pig, 7) . Various methionine 
variants were expressed in shake-flask cultures of B. subtilis and 
the crude supe mat ants purified by ammonium sulphate cuts. The 
amylase was precipitated from a 20% saturated ammonium sulphate 
supernatant by raising the ammonium sulphate, to 70% saturated, and 
then resuspended. The variants were then exposed to 0.88M hydrogen 
peroxide at .pH 5.0 # at 25°C. Variants at six of the methionine 
positions in B. licheniformis alpha-amylase were still subject to 
oxidation by peroxide while the substitution at position +197 
(M197L) showed resistance to peroxide oxidation. (See Fig. 7.) 
However, subsequent analysis described in further detail below 
showed that while a variant may be susceptible to oxidation at pH 
5.0, 25°C, it may exhibit altered/ enhanced properties under 
different conditions (i.e., liquefaction). 

Example 3 

Construction of All Possible Variants at Posit ion 197 
All of the M197 variants (M197X) were produced in the A4 form by 
cassette mutagenesis, as outlined in Fig. 8; 
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1) Site directed mutagenesis (via primer ext nsion in M13) 
was used to make M197A using th mutagenic oligonucl otide 
below: 

M197A 

5* -GAT TAT TTG OCG TAT GCC GAT ATC GAC TAT GAC CAT- 3 ' 

EcoRV* 

eiai - Seq ID No 11 

which also inserted an EcoRV site (codons 200-201) to replace 
the Clal site (codons 201-202). 

2) Then primer LAAM12 (Table II) was used to introduce 
another silent restriction site (BstBI) over codons 186-186. 

3) The resultant Ml 97 A {BstBI+, EcoRV* ) variant was then 
subcloned (Pstl-SstI fragment) into plasmid pA4BL and the 
resultant plasmid digested with BstBI and EcoRV and the large 
vector-containing fragment isolated by electroelution from 
agarose gel. 

4) Synthetic primers LAAM14-30 (Table II) were each annealed 
with the largely complementary common primer LAAM13 (Table 

II) . The resulting cassettes encoded for all the remaining 
naturally occurring amino acids at position +197 and were 
ligated, individually, into the vector fragment prepared 
above. 
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The cass ttes were designed to destroy the EcoRV site upon ligation, 
thus plasmids from E. coli trans formants wer screened for loss of 
this uniqu site. In addition, the common bottom strand of the 
cassette contained a frame-shift and encoded a Nsil site, thus 
trans formants derived from this strand could be eliminated by 
screening for the presence of the unique Nsil site and would not be 
expected, in any case, to lead to expression of active amylase* 

Positives by restriction analysis were confirmed by sequencing and 
transformed in B. subtilis for expression in shake-flask cultures 
(Fig. 9) . The specific activity of certain of the M197X mutants was 
then determined using a soluble substrate assay. The data generated 
using the following assay methods are presented below in Table III. 

Soluble Substrate Assay : A rate assay was developed based on an. 
end-point assay kit supplied by Megazyme (Aust.) Pty. Ltd.: Each 
vial of substrate (p-nitrophenyl maltoheptaoside, BPNPG7) was 
dissolved in 10ml of sterile water, followed by a 1 to 4 dilution in 
assay buffer (50mM maleate buffer, pH 6.7, 5mM calcium chloride, 
0.002% Tween20) . Assays were performed by adding lOpI of amylase to 
790ui of the substrate in a cuvette at 25°C. Rates of hydrolysis 
were measured as the rate of change of absorbance at 410nm, * after a 
delay of 75 seconds. The assay was linear up to rates of 0.4 
absorption units /min. 

The amylase protein concentration was measured using the standard 
Bio-Rad assay (Bio-Rad Laboratories) based on the method of 
Bradford, M. (1976) Anal. Biochem. 72:248) using bovine serum 
albumin standards. 

starch Hydrolysis Assay ; The standard method for assaying the 
alpha-amylase activity of Spezymeft AA20 was used. This method is 
described in detail in Example 1 of USSN 07/765,624, incorporated 

* 

herein by reference. Native starch forms a blue color with iodine 
but fails to do so when it is hydrolyzed into shorter dextrin 
molecules. The substrate is soluble Lintner starch Sgm/liter in 
phosphate buffer, pH 6.2 (42 . Sgm/liter potassium dihydrogen 
phosphate, 3.16gm/liter sodium hydroxide). The sample is added in 
25mM calcium chloride and activity is measured as the time taken to 
give a n gative iodin test upon incubation at 30°C. Activity is 
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recorded in liquefons per gram or ml (LU) calculat d according to 
the formula: 



LU/ml or LU/g « 5*70 x D 

V X t 



Where LU=liguefon unit 

Vcvolume of sample (5ml) 
t-dextrinization time (minutes) 

D=dilution f actor*dilution volume/ml or g of added enzyme. 



TABLE III 

SPECIFIC ACTIVITY (as % of AA2Q valii^ nn : 



ALPHA-AMYLA5E 


Soluble Substrate 


St&TCh 


Spezyme* AA20 


100 


100 


A4 form 


105 


115 


M15L (A4 form) 


93 


94 


M1SL 


85 


103 


M197T (A.4 form) 


75 


83 


M197T 


62 


81 


M197A (A4 form) 


88 


89 


M197C 


85 


85 


M197L (A4 form) 


51 


17 



Example 4 
Characterization of Variant. Ml 51/ 

Variant M15L made as per the prior examples did not show increased 
amylase activity {Table III) and was still inactivated by hydrogen 
peroxide (Fig. 7). It did, however, show significantly increased 
performance in jet-liquefaction of starch, especially at low pH as 
shown in Table IV below. 

Starch liquefaction was typically performed using a Hydroheater M 
103-M steam jet equipped with a 2,5 liter delay coil behind the 
mixing chamber and a terminal back pressure valve. Starch was fed 
to the jet by a Moyno pump and steam was supplied by a 150 psi steam 
line, reduced to 90-100 psi. Temperature probes were installed just 
after the Hydroheater jet and just before the back pressure valve. 

Starch slurry was obtained from a corn wet miller and used within 
two days. The starch was diluted to the desired solids level with 
deionized water and the pH of the starch was adjusted with 2% NaOH 
or saturated Na 3 C0 3 . Typical liquefaction conditions were: 
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Starch 32%-35% solids 

Calcium 40-50 ppm (30 ppm add d) 

pH 5.0-6.0 

Alpha-amylas 12-14 LU/g starch dry basis 

Starch was introduced into the jet at about 350 ml/xnin. The jet 
temperature was held at 105°-107°c. Samples of starch were 
transferred from the jet cooker to a 95°C second stage liquefaction 
and held for 90 minutes. 

The degree of starch liquefaction was measured immediately after the 
second stage liquefaction by determining the dextrose equivalence 
(DE) of the sample and by testing for the presence of raw starch, 
both according to the methods described in the standard Analytic! 
Methods of the Member Companies of the Corn Refiners Association, 
Inc. . sixth edition. Starch, when treated generally under the 
conditions given above and at pH 6.0, will yield a liquefied starch 
with a DE of about 10 and with no raw starch. Results of starch 
liquefaction tests using mutants of the present invention are 
provided in Table IV. 



TABLE TV 

Performance of Variants M15L JA4 form! and M15L in starch T.irr ngfaet i ftn 





SB 


DE after 90 Mins . 


Spezyme® AA20 


5.9 


9.9 


M15L (A4 form) 


5.9 


10.4 


Spezyme® AA20 


5.2 


1.2 


M15L (A4 form) 


5.2 


2.2 


Spezyme® AA20 


5.9 


9.3* 


M15L 


5.9 


11.3* 


Spezyme® AA20 


5.5 


3.25** 


M15L 


5.5 


6.7** 


Spezyme® AA20 


5.2 


0.7** 


M15L 


5.2 


3.65** 



* average of three experiments 
** average of two experiments 

Exairml* 5 
Construction of M15X Variants 
Following generally the processes described in Example 3 above, all 
variants at M15 (M15X) were produced in nativ fl. Jicheniformis by 
cassette mutagenesis, as outlined in Fig. 12: 
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1) Site directed mutagenesis (via primer extension in M13) was 
used to introduce unique restriction sites flanking the M15 codon to 
facilitate insertion of a mutagenesis cassett . Specifically, a 
BstBl site at codons 11*13 and a Mscl site at codons 18-20 were 
introduced using the two oligonucleotides shown below: 

MISXBstBl 5 ' -G ATG CAG TAT TTC GAA CTGG TAT A-3 1 

BstBl Seq ID No 48 

MlSXMscl 5'-TG CCC AAT G AT GGC CAA CAT TGG AAG-3 1 

Mscl Seq ID No 49 

2) The vector for M15X cassette mutagenesis was then constructed 
by subcloning the Sfil-Sstll fragment from the mutagenized amylase 
(BstBl*, Mscl+) into plasmid pBLapr. The resulting plasmid was then 
digested with BstBl and Mscl and the large vector fragment isolated 
by electroelution from a polyacrylamide gel. 

3) Mutagenesis cassettes were created as with the M197X variants. 
Synthetic oligomers, each encoding a substitution at codon 15, were 
annealed tp a common bottom primer. Upon proper ligation of the 
cassette to the vector, the Mscl is destroyed allowing for screening 
of positive transf ormants by loss of this site. The bottom primer 
contains an unique SnaBl site allowing for the transf ormants derived 
from the bottom strand to be eliminated by screening for the SnaBl 
site. This primer also contains a frame shift which would also 
eliminate amylase expression for the mutants derived from the common 
bottom strand. 

The synthetic cassettes are listed in Table V and the general 
cassette mutagenesis strategy is illustrated in Figure 12. 
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TftBLE V 

fixmthetig Oligonucleotides V*m& for Ca***tt* Mutagenesis 

i»r> PrMuPP Variants 



Ml 5 A 


(BstBl) 


C 


GAA 


TGG 


TAT 


GOT 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


50 


M15R 


(BstBl) 


c 


GAA 


TGG 


TAT 


CGC 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seg 


ID Mo 


51 


M15N 


(BstBl) 


c 


GAA 


TGG 


TAT 


AAT 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


52 


M15D 


(BstBl) 


c 


GAA 


TGG 


TAT 


GAT 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


53 


M15H 


(BstBl) 


c 


GAA 


TGG 


TAT 


CAC 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


54 


M15K 


(BstBl) 


c 


GAA 


TGG 


TAT 


AAA 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


55 


M15P 


(BstBl) 


c 


GAA 


TGG 


TAT 


CCG 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


56 


MISS 


(BstBl) 


c 


GAA 


TGG 


TAT 


TCT 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


57 


M15T 


(BstBl) 


c 


GAA 


TGG 


TAC 


ACT 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


58 


M15V 


(BstBl) 


c 


GAA 


TGG 


TAT 


GTT 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


59 


M15C 


(BstBl) 


c 


GAA 


TGG 


TAT 


TGT 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


60 


M15Q 


(BstBl) 


c 


GAA 


TGG 


TAT 


CAA 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


61 


M15E 


(BstBl) 


c 


GAA 


TGG 


TAT 


GAA 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


62 


M15G 


(BstBl) 


c 


GAA 


TGG 


TAT 


fifil 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


63 


M15I 


(BstBl) 


c 


GAA 


TGG 


TAT 


ATT 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


64 


M15F 


(BstBl) 


c 


GAA 


TGG 


TAT 


TTT 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


65 


M15W 


(BstBl) 


c 


GAA 


TGG 


TAC 


TGG 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


66 


M15Y 


(BstBl) 


c 


GAA 


TGG 


TAT 


lai 


CCC 


AAT 


GAC 


GG 


(Mscl) 


Seq 


ID No 


67 


M15X (Mscl) CC 
(bottom strand) 


GTC 


ATT 


GGG 


ACT 


ACG 


TAC 


CAT 


T 


(BstBl) 


Seq 


ID No 


68 



Underline indicates codon changes at amino acid position 15. 

Conservative substitutions were made in some cases to prevent 
introduction of new restriction sites. 

Example 6 

Bench Liquefaction with M15X Variants 
El ven alpha-amylase variants with substitutions for M15 made as per 
Example 5 were assayed for activity, as compared to Spezymea AA20 
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(commercially available from Genencor International, Inc.) in 
liquefaction at pH 5.5 using a bench liquefaction system. The bench 
scale liquefaction system consisted of a stainless steel c il (0.25 
inch diameter, approximately 350 ml volume) equipped with a 7 inch 
long static mixing element approximately 12 inches from the anterior 

nd and a 30 psi back pressure valve at the posterior end. The 
coil, except for each end, was immersed in a glycerol -water bath 

quipped with thermostatically controlled heating elements that 
maintained the bath at 105-106°C. 

Starch slurry containing enzyme, maintained in suspension by 
stirring, was introduced into the reaction coil by a piston driven 
metering pump at about 70 ml/min. The starch was recovered from the 
end of the coil and was transferred to the secondary hold (95°C for 
90 minutes) . Immediately after the secondary hold, the DE of the 
liquefied starch was determined, as described in Example 4 . The 
results are shown in Fig. 16. 

Example 7 

ehararterizaticm pf M197X Variants 
As can be seen in Fig. 9, there was a wide range of amylase activity 
(measured in the soluble substrate assay) expressed by the M197X (A4 
form) variants. The amylases were partially purified from the 
supernatants by precipitation with two volumes of ethanol and 
resuspension. They were then screened for thermal stability (Fig. 
10) by heating at 95 C C for 5 minutes in lOmM acetate buffer pH 5.0, 
in the presence of 5mM calcium chloride; the A4 wild- type retained 
26% of its activity after incubation. For M197W and H197P we were 
unable to recover active protein from the supernatants. Upon 
sequencing, the M197H variant was found to contain a second 
mutation, N190K. M197L was examined in a separate experiment and 
was one of the lowest thermally stable variants. There appears to 
be a broad correlation between expression of amylase activity and 
thermal stability. The lichen! formis amylase is restricted in what 
residues it can accommodate at position 197 in terms of retaining or 
enhancing thermal stability: cysteine and threonine are preferred 
for maximal thermal stability under these conditions whereas alanine 
and isoleucine are of intermediate stability. However, other 
substitutions at position +197 result in lowered thermal stability 
which may be useful for other applications. Additionally, different 
substitutions at +197 may hav other beneficial properties, such as 
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altered pH perf rxnance profil or alter d oxidative stability. For 
exampl , the M197C variant was found to inactivate readily by air 
oxidation but had enhanced th ratal stability. Convers ly, c xnpared 
to the M197L variant, both M197T and M197A retained not only high 
thermal stability (Fig. 10), but also high activity (Table III), 
while maintaining resistance to inactivation by peroxide at pH 5 to 
pH 10 (Fig. 7) . 

Example B 

Stability and Performance in Detergent Formulation 
The stability of the M197T (A4 form) , M197T and M197A (A4 form) was 
measured in automatic dish care detergent (ADD) matrices. 2ppm 
Savinase™ (a protease, commercially available from Novo Industries, 
of the type commonly used in ADD) were added to two commercially 
available bleach-containing ADD*s: Cascade™ (Procter and Gamble, 
Ltd.) and Sunlight™ (Unilever) and the time course of inactivation 
of the amylase variants and Termamyl™ (a thermally stable alpha - 
amylase available from Novo Nordisk, A/S) followed at 65*C. The 
concentration of ADD product used in both cases was equivalent to 
'pre-soak* conditions: 14gm product per liter of water (7 grams per 
gallon hardness). As can be seen (Figs. 11a and lib), both forms of 
the M197T variant were much more stable than Termamyl'" and M197A (A4 
form) , which were inactivated before the first assay could be 
performed. This stability benefit was seen in the presence or 
absence of starch as determined by the following protocol. Amylases 
were added to 5ml of ADD and Savinase™, prewarmed in a test tube 
and, after vortexing, activities were assayed as a function of time, 
using the soluble substrate assay. The •+ starch" tube had 
spaghetti starch baked onto the sides (140°C, 60 nvins.). The 
results are shown in Figs. 11a and lib. 

Characterization of M15X Variant* 
All M15X variants were propagated in Bacillus svbtilis and the 
expression level monitored as shown in Fig. 13. The amylase was 
isolated and partially purified by a 20-70% ammonium sulfate cut. 
The specific activity of these variants on the soluble substrate was 
determined as per Example 3 (Fig. 14). Many of the M15X amylases 
have specific activities greater than that of Spezymeft AA20. A 
benchtop h at stability assay was performed on th variants by 
heating the amylase at 90 °C for 5 min. in 50 mM acetate buffer pH 5 
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in the presence of 5 znM CaCl 3 (Fig. 15). Most of the variants 
performed as well as Spezymett AA20 in this assay. Thos variants 
that exhibited reasonable stability in this assay (reasonable 
stability defined as those that retained at least about 60% of 
Spezyxne0 AA20's heat stability) were tested for specific activity on 
starch and for liquefaction performance at pH 5.5. The most 
interesting of those mutants are shown in Fig. 16. M15D, N and T, 
along with L, outperformed Spezymefc AA20 in liquefaction at pH 5.5 
and have increased specific activities in both the soluble substrate 
and starch hydrolysis assays. 

Generally, we have found that by substituting for the methionine at 
position 15, we can provide variants with increased low pH- 
liquefaction performance and/or increased specific activity. 

Example IP 

Tryptophan Sensitivity to Oxidation 
Chloramine-T (sodium N-chloro-p-toluenesulfonimide) is a selective 
oxidant, which oxidizes methionine to methionine sulfoxide at 
neutral or alkaline pH. At acidic pH, chloramine-T will modify both 
methionine and tryptophan (Schechter, Y., Bur stein, Y. and 
Patchornik, A* (1975) Biochemistry 14 (20) 4497-4503). Fig. 17 
shows the inactivation of B. lichenifcrmis alpha-amylase with 
chloramine-T at pH 8.0 (AA20 « 0.65 mg/ml, Ml 97 A = 1.7 mg/ml, M197L 
= 1.7 mg/ml). The data shows that by changing the methionine at 
position 197 to leucine or alanine, the inactivation of alpha- 
amylase can be prevented. Conversely, as shown in Fig. 18, at pH 
4.0 inactivation of the M197A and M197L proceeds, but require more 
equivalents of chloramine-T (Fig. 18; AA20 = 0.22 mg/ml, M197A =4.3 
mg/ml, M197L = 0.53 mg/ml; 200 mM NaAcetate at pH 4.0). This 
suggests that a tryptophan residue is also implicated in the 
chloramine-T mediated inactivation event. Furthermore, tryptic 
mapping and subsequent amino acid sequencing indicated that the 
tryptophan at position 138 was oxidized by chloramine-T (data not 
shown) . To prove this, site-directed mutants were made at 
tryptophan 138 as provided below: 

Preparation of Alpha-Amvlase Double Mutants W13B and Ml 97 
Certain variants of W138 (F, Y and A) were made as double mutants, 
with M197T (made as per the disclosure of Example 3). Th doubl 
mutants wer mad following the methods described in Examples 1 and 
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3. Gen rally, singl negative strands of DNA were prepared from an 
M13MP1B clone of the 1.72kb coding s quence (Pst I-Sst I) f the B. 
lichen! formis alpha-amylase M197T mutant. Site-directed 
mutagenesis was done using the primers listed below, essentially by 
the method of Zoller, M. et al. (1983) except T4 gene 32 protein and 
T4 polymerase were substituted for klenow. The primers all 
contained unique sites, as well as the desired mutation, in order to 
identify those clones with the appropriate mutation. 

Tryptophan 13 B to Phenylalanine 

133 134 135 136 137 130 139 140 141 142 143 

CAC CTA ATT AAA GOT w c ACA CAT TTT CAT TTT Sea ID No 42 

Hind III 

Tryptophan 138 to Tyrosine 

133 134 135 136 137 13» 139 140 141 142 143 

CAC CTA ATT AAA GCT T AG ACA CAT TTT CAT TTT Sea ID No 43 

Hind III 

Tryptophan 13 B to Alanine - This primer also engineers unique sites 
upstream and downstream of the 138 position. 

127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 
C CGC GTA ATT TCC GSA GAA CAC CTA ATT AAA GCC OCA ACA CAT TTT CAT 

BspE I 

143 144 145 146 147 

TTT CCC QQQ CGC GGC AG Seq ID No 44 

Xma I 

Mutants were identified by restriction analysis and W138F and W138Y 
confirmed by UNA sequencing* The W138A sequence revealed a 

* 

nucleotide deletion between the unique BspE I and Xma I sites, 
however, the rest of the gene sequenced correctly. The 1.37kb 
Sstll /SB tl fragment containing both W138X and M197T mutations was 
moved from M13MP18 into the expression vector pBLapr resulting in 
pBLapr (W138F, M197T) and pBLapr (W138Y, M197T) . The fragment 
containing unique BspE I and Xma I sites was cloned into pBLapr 
(BspE I, Xma I, M197T) since it is useful for cloning cassettes 
containing other amino acid substitutions at position 138. 



Single Mutations at Amino Acid Position 13fl 

Following the g n ral methods described in the prior exampl s, 
certain single variants of W138 (F, Y, L # H and C) were made. 
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The 1.24kb Asp718-Sstl fragment containing the M197T mutation in 
plasmid pBLapr (W138X, M197T) of Example 7 was replaced by the wild- 
type fragment with methionine at 197, resulting in pBLapr (W138F) , 
pBLapr (W138Y) and pBLapr (BspE I, Xma I) . 

The mutants W138L, W138H and W138C were made by ligating synthetic 
cassettes into the pBLapr (BspE I, Xma I) vector using the following 
primers ; 

Tryptophan 13B tp Leucine 

CC GGA GAA CAC CTA ATT AAA GCC CTA ACA CAT TTT CAT TTT C 

Seq ID No 45 

Tryptophan 13 8 to Histidine 

CC GGA GAA CAC CTA ATT AAA GCC CAC ACA CAT TTT CAT TTT C 

Seq ID No 46 

Tryptophan 13 B to Cysteine 

CC GGA GAA CAC CTA ATT AAA GCC TOC ACA CAT TTT CAT TTT C 

Seq ID No 47 

Reaction of the double mutants M197T/W138F and M197T/W138Y with 
chloramine-T was compared with wild- type (AA20 « 0,75 mg/ml, 
M197T/W138F = 0.64 mg/ml , M197T/W138Y « 0.60 mg/ml; 50 nW NaAcetate 
at pH 5.0) . The results shown in Fig. 19 show that mutagenesis of 
tryptophan 138 has caused the variant to be more resistant to 
chl oramine -T . 

EXflffiPlff 11 

Preparation pf Multiple Mutants 

Following the methods of Examples 1,3, 5 and 10, the following 
multiple mutants were made: M15T/M197T; M15S/M197T; W138Y/M197T; 
M15S/W138Y/M197T and M15T/W138Y/M197T. Certain of these multiple 
mutants were previously exemplified, for example, W138Y/M197T was 
made and tested in Example 10. The multiple mutants were identified 
by restriction analysis. 

Various multiple mutants within the scope of the present invention 
were further tested for performance as cleaning products (automatic 
dish care detergent) additives. These t sts are detailed below. 
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Stability Testing 

A 4000 ppm solution of automatic dishwashing detergent (ADD) 
containing perborate and TAED was prepared in wat r with a hardness 
of 7 gpg. Certain amylase mutants described above were added to 
this ADD solution to yield a rate of 0.4 when assayed by the 
Ceralpha method (Megazyme (Austr.) Pty. Ltd./ Parramatta, NSW, 
Australia) . One set of samples was held at room temperature (21- 
23°C) for about 30 min. (non-heated) . A second set of samples was 
wanned from room temperature to about 65°C after addition of the 
enzyme (heated) . 30 min. after addition of the enzyme, the activity 
of the amylase mutants was measured and the activity relative to the 
activity at the time of addition of the enzyme was calculated 
(relative activity %) . 

The results shown in Fig. 20 indicate that the methionine at 
position +197 of B. licheniformis alpha-amylase should be modified 
for stability in a formulation comprising ADD + perborate + TAED. 

Starch Hydrolysis Assqy 

A 4000 ppm solution of automatic dishwashing detergent (ADD) 
containing perborate and TAED was prepared in water with a hardness 
of 1 gpg and three cooked pieces of elbow macaroni were added. The 
amylase mutants described above were added to this ADD solution to 
yield a final concentration of 5 ppm active enzyme. The tubes were 
incubated at 50 °C for about 30 min. and the concentration of 
reducing sugars released was measured against a glucose standard 
curve using the dinitrosalicylic acid method. Results are shown in 
Table VI. 
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Table VI 



Enzyme 


Reducing Sugar 
Concentration (o/l) 


Standard Deviation 


No Enzyme 


1 . ©4 




Wild-Type 


4.97 


0.30 


M15S/M197T 


5.40 


0.36 


M15T/M197T 


5.B5 


0.38 


W138Y/M197T 


6.48 


0.36 


M15S/W138Y/M197T 


6.04 


0.74 


M15T/W138Y/M197T 


6.27 


0.49 



The results shown in Table VI show that M15T/K197T; M15S/M197T; 
W138Y/M197T; M15S/W138Y/M197T and M15T/W138Y/M197T performed well 
compared to no enzyme and wild-type alpha-amylase controls. 

patmeal Stains 

Dishes were evenly soiled with a cooked, blended oatmeal paste and 
dried overnight at 37 °C. Dishes were loaded in an ASKO Model 770 
dishwasher and washed at 45°C on the Quick Wash cycle using 10 g of 
automatic dishwashing detergent containing 5% perborate, 3% TAED and 
11 mg of certain amylase enzyme (s) . The plates were weighed before 
soiling, after soiling and after washing, and the average % soil 
removed from all plates was calculated. The data are shown below in 
Table VII. 



Table VTT 



Sasym* 


% Sell Xamovad 1 
(A**r*0« of All Diahaa) 


Wild-Type 


61 


M15S/M197T 


66 


M15T/M197T 


71 


W138Y/M197T 


68 


M15S/W138Y/M197T 


62 


M15T/W138Y/M197T 


72 
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The data show that the mutant enzymes provided a benefit greater 
than that provided by the wild-type. Wild-type amylase provided a 
20% greater cleaning benefit in removing oatmeal than did ADD 
without amylase. 

Example 12 
Dish Care Cleaning Composition 
1% (w/w) granules of wild-type and mutant amylases were formulated 
with a Korex Automatic Dishwasher Detergent to which 5% (w/w) sodium 
perborate monohydrate and 3% (w/w) TAED were added. Samples of 
these formulations were placed at room temperature (21-23*C) or at 
3B°C and 80% relative humidity for four weeks. Results are shown in 
Figs. 21 and 22. 

The data show that the wild-type amylase activity, as measured by 
the Ceralpha method, decreased with increasing storage time in 
detergent. At room temperature, the mutant enzymes were completely 
stable. At 3B°C and 80% relative humidity, all mutants were more 
stable than the wild-type. 

The advantage of formulating an automatic dishwashing detergent with 
these mutant amylases is that these mutants are significantly more 
stable than the wild-type in the presence of perborate and TAED and 
they provide a significant performance benefit in removing starchy 
food stains in the wash. 

Example 13 

Oxidativelv Stable Protease/Oxidatively Stable Amylase 
Stability Studies 

Enzyme granules containing either: 1) wild-type protease and wild- 
type amylase; or 2) bleach stable protease (GG36-M222S) made by the 
methods described in US Re 34606 and bleach stable amylase (AA20- 
M15T/W138Y/M197T) were dissolved in buffer containing 0.1 M sodium 
borate pH 10.2 and 0.005% Tween 80 at a concentration of 12.5 mg of 
each enzyme. To 9 ml of these solutions was added either 1 ml 
distilled water or 1 ml 30% hydrogen peroxide. After incubation of 
the solutions at 25*C for 30 minutes, the protease and amylase 
activity in each solution was measured and is reported as % of the 
original activity. The data are shown below in Table VIII. 
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Table VIII 



1 VlTeatJttUt 




% Activity 
After 30 Min 


Water 


WT Amylase 


104 


Water 


WT Protease 


94 


Water 


M222S Protease 119 


Water 


TYT Amylase 


68 


3% Peroxide 


WT Amylase 


14 


3% Peroxide 


. WT Protease 


7 


3% Peroxide 


H222S Protease 116 


3% Peroxide 


TYT Amylase 


75 



The data show that the combination of a bleach-stable amylase mutant 
and a bleach-stable protease mutant, both with mutations at amino 
acid residues sensitive to oxidation, provides the combined benefits 
of protease and amylase in a formulation resistant to inactivation 
by bleach. The combination of a bleach-stable amylase and a bleach- 
stable protease retains most of its initial activity after 30 
minutes in bleach, while the combination of wild-type enzymes loses 
over 60% of its initial activity in the same period of time. 
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SEQUENCE LISTING 



(X) GENERAL INFORMATION: 

(i) APPLICANT: Barnett, Christopher 
Mitchineon, Colin 
Power, Scott D. 

(ii) TITLE OF INVENTION: An Improved Cleaning Composition 

(iii) NUMBER OF SEQUENCES: 66 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Oenencor International 

<B) STREET: 180 Kimball Way 

<C) CITY: South San Francisco 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
tC) CLASSIFICATION: 

(viii> ATTORNEY/AGENT INFORMATION : 

(A) NAME: Horn, Margaret A. 

(B) REGISTRATION NUMBER: 33 f 401 

(C) REFERENCE /DOCKET NUMBER: GC220-3 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 742-7536 

(B) TELEFAX: (415) 742-7217 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
GATCAAAACA TAAAAAACCG GCCTTGGCCC CGCCCGTTTT TTATTATTTT TGAGCT 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TGGGACGCTG CCGCACTACT TTGAATGGT 
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(2) INFORMATION FOR SEQ ZD NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lintar 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
TGATCCAGTA CTTTQAATGG TACCTOCCCA ATGA 
(2) INFORMATION FOR SEQ ZD NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
GATTATTTGT TCTATGCCGA TATCGACTAT GACCAT 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
CGGGGAAGGA GGCCTTTACG GTAGCT 
(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bass pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
GCGGCTATGA CTTAAGGAAA TTGC 
(2) INFORMATION FOR SEQ XD NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (gen mic) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 
CTACGGGGAT GCATACGGGA CCA 23 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 base pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

CTACGGGGAT TACTACGGGA CCAAGGGAGA CTCCC 35 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCGGTGGGGC CAAGCGGGCC TATGTTGGCC GGCAAA 36 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CATCAGCGTC CCATTAAGAT TTGCAGCCTG CGCAGACATG TTGCT 45 
(2) INFORMATION FOR SEQ ID NO: 11: 

( i ) SEQUENCE CHARACTER! STI CS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GATTATTTGG CGTATGCCGA TATCGACTAT GACCAT 36 
(2) INFORMATION FOR SEQ XD NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND EDKESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGGAAGTTTC GAATGAAAAC G 
(2) INFORMATION FOR SEQ ID NOj13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 bat* pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTCGCCATAT GCATATAATC ATAGTTGCCG TTTTCATT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CGAATGAAAA CGGCAACTAT GATTATTTGA TCTATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS: 
CGAATGAAAA CGGCAACTAT GATTATTTGT TCTATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGAATGAAAA CGGCAACTAT GATTATTTGC TTTATGCCGA C 
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(2) INFORMATION FOR SEQ ZD NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CGAATGAAAA CCGCAACTAT GATTATTTGA OCTATCCCGA C 

(2) INFORMATION FOR SEQ ID NO: IB: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
CGAATGAAAA CCGCAACTAT GATTATTTGC CTTATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS i single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGAATGAAAA CGGCAACTAT GATTATTTGA CATATGCCGA C 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 base pairs 
(2) TYPE: nucleie aeid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CGAATGAAAA CGGCAACTAT GATTATTTGT ACTATGCCGA C 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGAATGAAAA CGCCAACTAT GATTATTTGC ACTATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STPANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGAATGAAAA CGGCAACTAT GATTATTTGG GCTATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 23: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGAATGAAAA CGGCAACTAT GATTATTTGC AATATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGAATGAAAA CGGCAACTAT GATTATTTGA ACTATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
CGAATGAAAA CGGCAACTAT GATTATTTGA AATATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 
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<C) STRANDEDNESS: singl 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: SNA (genomic) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO;26: 
CGAATGAAAA CGGCAACTAT GATTATTTGG ATTATCCCGA C 
(2) INFORMATION FOR SEQ ID NO:27; 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:27: 
CGAATGAAAA CGGCAACTAT GATTATTTGG AATATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2B: 
CGAATGAAAA CGGCAACTAT GATTATTTGT CTATTGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 : 
CGAATGAAAA CGGCAACTAT GATTATTTGT GGTATGCCGA C 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CGAATGAAAA CGGCAACTAT GATTATTTGA GATATGCCGA C 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1968 bast pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

<ii> MOLECULE TYPE: DNA < genomic) 

(Xi) SEQUENCE DESCRIPTION: 5EQ ID NO: 31: 

AGCTTGAAGA AGTGAAGAAG CAGAGAGGCT ATTGAATAAA TGAGTAGAAA GCGCCATATC 60 

GGCGCTTTTC TTTTGGAAGA AAATATAGGG AAAATGGTAC TTGTTAAAAA TTCGGAATAT 120 

TTATACAACA TCATATGTTT CACATTGAAA GGGGAGGAGA ATCATGAAAC AACAAAAACG 160 

GCTTTACGCC CGATTGCTGA CCCTGTTATT TGCGCTCATC TTCTTGCTGC CTCATTCTGC 240 

AGCAGCGGCG GCAAATCTTA ATGGGACGCT GATGCAGTAT TTTGAATGGT XCATGCCCAA 300 

TGACGGCCAA CATTCGAAGC GTTTGCAAAA CGACTCCGCA TATTTGGCTG AACACGCTAT 360 

TACTGCCGTC TGGATTCCCC CGGCATATAA GGGAACGAGC CAAGCGGATG TGGGCTACGG 420 

TGCTTACGAC CTTTATGATT TAGGGGAGTT TCATCAAAAA GGGACGGTTC GGACAAAGTA 460 

CGGCACAAAA GGAGACCTCC AATCTGCGAT CAAAAGTCTT CATTCCCGCG ACATTAACGT 540 

TTACGGCGAT GTGGTCATCA ACCACAAAGG CCGCGCTGAT GCGACCGAAG ATGTAACCGC 600 

GGTTGAAGTC GATCCCGCTG ACCGCAACCG CGTAATTTCA GGAGAACACC TAATTAAAGC 660 

CTGGACACAT TTTCATTTTC CGGGGCGCGG CAGCACATAC AGCGATTTTA AATGGCATTG 720 

GTACCATTTT GACGGAACCG ATTGGGACGA GTCCCGAAAG CTGAACCGCA TCTATAAGTT 760 

TCAAGGAAAG GCTTGGGATT GGGAACTTTC CAATGAAAAC GGCAACTATG ATTATTTGAT 840 

GTATGCCGAC ATCGATTATG ACCATCCTGA TGTCGCAGCA GAAATTAAGA GATGGGGCAC 900 

TTCGTATGCC AATGAACTGC AATTGGACGG TTTCCGTCTT GATCCTGTCA AACACATTAA 960 

ATTTTCTTTT TTGCGGGATT GGGTTAATCA TCTCAGGGAA AAAACGGGGA AGGAAATGTT 1020 

TACGGTAGCT GAATATTGGC AGAATGACTT OGGCGCGCTG GAAAACTATT TGAACAAAAC 1080 

AAATTTTAAT CATTCAGTCT TTGACCTGCC CCTTCATTAT CAGTTCCATC CTGCATCGAC 1100 

ACAGGGAGGC GGCTATGATA TGAGGAAATT GCTGAACGGT ACGGTCGTTT CCAAGCATCC 1200 

GTTGAAATCG GTTACATTTG TCGATAACCA TGATACACAG CCGGGGCAAT CGCTTGAOTC 1260 

GACTGTCCAA ACATGGTTTA AGCCGCTTGC TTACGCTTTT ATTCTCACAA GGGAATCTGG 1320 

ATACCCTCAG GTTTTCTACG GGGATATGTA CGGGACGAAA GGAGACTCCC AGCGCGAAAT 1360 

TCCTGCCTTG AAACACAAAA TTGAACCGAT CTTAAAAGCG AGAAAACAGT ATGCGTACGG 1440 

AGCACAGCAT GATTATTTCG ACCACCATGA CATTGTCGGC TGGACAAGGG AAGGCGACAG 1500 

CTCGGTTGCA AATTCAGGTT TGGCGGCATT AATAACAGAC GGACCCGGTG GGGCAAAGCG 1560 

AATGTATGTC GGCCGGCAAA ACGCCGGTGA GACATGGCAT GACATTACCG GAAACCGTTC 1620 

GGAGCCGCTT GTCATCAATT CGGAAGCCTG GGGAGAGTTT CACGTAAACG GCGGGTCGGT 1680 

TTCAATTTAT GTTCAAAGAT AGAAGAGCAG AGAGGACGGA TTTCCTGAAG GAAATCCGTT 1740 

TTTTTATTTT GCCCGTCTTA TAAATTTCTT TGATTACATT TTATAATTAA TTTTAACAAA 1800 
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GTCTCATCAG CCCTCAGGAA OGACTTGCTC ACACTTTGAA TCGCATACCT AAGGCGGGGA 1860 

TGAAATGCCA ACGTTATCTG ATGTAGCAAA OAAAGCAAAT GTGTCQAAAA TGACOGTATC 1920 

GCGGGTGATC AATCATCCTG AGACTGTGAC GGATGAATTG AAAAAGCT 1968 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro 
15 10 15 

Asn Asp Gly Gin His Trp Lys Arg Leu Gin Asn Asp Ser Ala Tyr Leu 
20 25 30 

Ala Glu His Gly He Thr Ala Val Trp !!• Pro Pro Ala Tyr Lys Gly 
35 40 45 

Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 
50 55 60 

Gly Glu Phe His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 

65 ... 70 75 80 

Gly Glu Leu Gin Ser Ala He Lys Ser Leu His Ser Arg Asp He Asn 
85 90 95 

Val Tyr Gly Asp Val Val He Asn His Lys Gly Gly Ala Asp Ala Thr 
100 105 110 

Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val 

115 120 125 

He Ser Gly Glu His Leu He Lys Ala Trp Thr His Phe His Phe Pro 
130 135 140 

Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 
145 150 155 160 

Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg He Tyr Lys 
165 170 175 

Phe Gin Gly Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn 
180 185 190 

Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Tyr Asp Hia Pro Asp Val 

195 200 205 

Ala Ala Glu He Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin 
210 215 220 

Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Phe Ser Phe 

225 230 235 240 

Leu Arg Asp Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met 
245 250 255 

Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn 
260 265 270 

Tyr Leu Asn Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu 
275 280 285 
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His Tyr Gin Phe Hit Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Mat 

290 295 300 

Arg Lys Lau Leu Asn Gly Thr Val Val Ser Lys His Pro Leu Lys Sax 

305 310 315 320 

Val Thr Pha Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 
325 330 335 

Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu 
340 345 350 

Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly 
355 360 365 

Thr Lys Gly Asp Ser Gin Arg Glu lie Pro Ala Leu Lys His Lys lie 

370 375 380 

Glu Pro lie Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His 
385 390 395 400 

Asp Tyr Phe Asp His His Asp lie Val Gly Trp Thr Arg Glu Gly Asp 
405 410 415 

Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro 
420 425 430 

Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr 
435 440 445 

Trp His Asp lie Thr Gly Asn Arg Ser Glu Pro Val Val He Asn Ser 
450 455 460 

Glu Gly Trp Gly Glu Phe His Val Asn Gly Gly Ser Val Ser He Tyr 
465 470 475 480 

Val Gin Arg 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 511 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNE5S: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Lys Gin Gin Lys Arg Leu Tyr Ala Arg Leu Leu Thr Leu Leu Phe 
1 5 10 15 

Ala Leu He Phe Leu Leu Pro His Ser Ala Ala Ala Ala Ala Asn Leu 

20 25 30 

Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro Asn Asp Gly 
35 40 45 

His Trp Lys Arg Leu Gin Asn Asp Ser Ala Tyr Leu Ala Glu His Gly 
50 55 60 

He Thr Ala Val Trp He Pro Pro Ala Tyr Lys Gly Thr Ser Gin Ala 
65 70 75 80 

Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu Gly Glu Phe His 
85 90 95 

Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Gly Glu Leu Gin 
100 105 110 
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Ser Ala lie Lys Ser Leu His Ser Arg Asp lie Asn Val Tyr Gly Asp 

113 120 125 

Val Val lie Asn His Lys Gly Gly Ala Asp Ala Thr Glu Asp Val Thr 
130 135 140 

Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val lie Ser Gly Glu 
145 150 155 160 

His Leu 21e Lys Ala Trp Thr His Phe His Phe Pro Gly Arg Gly Ser 
165 170 175 

Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly Thr Asp 
180 185 190 

Trp Asp Glu Ser Arg Lys Leu Asn Arg lie Tyr Lys Phe Gin Gly Lys 
195 200 205 

Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Cly Asn Tyr Asp Tyr Leu 

210 215 220 

Met Tyr Ala Asp lie Asp Tyr Asp His Pro Asp Val Ala Ala Glu lie 

225 230 235 240 

Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin Leu Asp Cly Phe 
245 250 255 

Arg Leu Asp Ala Val Lys His lie Lys Phe Ser Phe Leu Arg Asp Trp 
260 265 270 

Val Asn His Val Arg Glu Lys Thr Cly Lys Glu Met Phe Thr Val Ala 
275 280 285 

Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn Tyr Leu Asn Lys 

290 295 300 

Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu His Tyr Gin Phe 
305 310 315 320 

His Ala Ala Ser Thr Gin Gly Gly Cly Tyr Asp Met Arg Lys Leu Leu 

325 330 335 

Asn Gly Thr Val Val Ser Lys His Pro Leu Lys Ser Val Thr Phe Val 
340 345 350 

Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu Ser Thr Val Gin 
355 360 365 

Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu Thr Arg Glu Ser 
370 375 380 

Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Cly Thr Lys Gly Asp 
385 390 395 400 

Ser Gin Arg Glu He Pro Ala Leu Lys His Lys He Glu Pro He Leu 
405 410 415 

Lys Ala Arg Lys Gin Tyr Ala Tyr Cly Ala Gin His Asp Tyr Phe Asp 
420 425 430 

His His Asp He Val Gly Trp Thr Arg Glu Gly Asp Ser Ser Val Ala 
435 440 445 

Asn Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro 'Gly Gly Ala Lvs 
450 455 460 

Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr Trp His Asp He 
465 470 475 480 

Thr Gly Asn Arg Ser Glu Pro Val Val He Asn Ser Glu Gly Trp Gly 
485 490 495 

Glu Phe His Val Asn Gly Gly Ser Val Ser 11 Tyr Val Gin Arg 
500 505 510 
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(2> INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 520 amino acids 

(B) TYPE: amino acid 

(C> ST HANDEDNESS : single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Arg Gly Arg Gly Asn Met lie Gin Lys Arg Lys Arg Thr Val Ser 
15 10 15 

Phe Arg Leu Val Leu Met Cys Thr Leu Leu Phe Val Ser Leu Pro He 
20 25 30 

Thr Lys Thr Ser Ala Val Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp 
35 40 45 

Tyr Thr Pro Asn Asp Gly Gin His Trp Lys Arg Leu Gin Asn Asp Ala 
50 55 60 

Glu His Leu Ser Asp He Gly He Thr Ala Val Trp He Pro Pro Ala 
65 70 75 80 

Tyr Lys Gly Leu Ser Gin Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu 
85 90 95 

Tyr Asp Leu Gly Glu Phe Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr 
100 105 110 

Gly Thr Lys Ser Glu Leu Gin Asp Ala He Gly Ser Leu His Ser Arg 
115 120 125 

Asn Val Gin Val Tyr Gly Asp Val Val Leu Asn His Lys Ala Gly Ala 
130 135 140 

Asp Ala Thr Glu Asp Val Thr Ala Val Glu Val Asn Pro Ala Asn Arg 

145 150 155 160 

Asn Gin Glu Thr Ser Glu Glu Tyr Gin He Lys Ala Trp Thr Asp Phe 
165 170 175 

Arg Phe Pro Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp His Trp 

180 185 190 

Tyr His Phe Asp Gly Ala Asp Trp Asp Glu Ser Arg Lys He Ser Arg 

195 200 205 

He Phe Lys Phe Arg Gly Glu Gly Lys Ala Trp Asp Trp Glu Val Ser 
210 215 220 

Ser Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr 
225 230 235 240 

Asp His Pro Asp Val Val Ala Glu Thr Lys Lys* Trp Gly He Trp Tyr 
245 250 255 

Ala Asn Glu Leu Ser Leu Asp Gly Phe Arg He Asp Ala Ala Lys His 
260 265 270 

He Lys Phe Ser Phe Leu Arg Asp Trp Val Gin Ala Val Arg Gin Ala 
275 280 285 

Thr Gly Lys Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala 

290 295 300 

Gly Lys Leu Glu Asn Tyr Leu Asn Lys Thr Ser Phe Asn Gin Ser Val 
305 310 315 320 
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Ph€ Asp Val Pro Leu His Phe Asn Leu Gin Ala Ale Ser Ser Gin Gly 
32S 330 335 

Gly Gly Tyr Asp Met Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg 
340 345 350 

His Pro Glu Lys Ala Val Thr Phe Val Glu Asn His Asp Thr Gin Pro 
355 360 365 

Gly Gin Ser Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala 
370 375 380 

Tyr Ala Phe lie Leu Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr 
385 390 395 400 

Gly Asp Met Tyr Gly Thr Lys Gly Thr Ser Pro Lys Glu lie Pro Ser 
405 410 415 

Leu Lys Asp Asn He Glu Prp He Leu Lys Ala Arg Lys Glu Tyr Ala 
420 425 430 

Tyr Gly Pro Gin His Asp Tyr He Asp His Pro Asp Val He Gly Trp 
435 440 445 

Thr Arg Glu Gly Asp Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu 
450 455 460 

He Thr Asp Gly Pro Gly Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys 
465 470 475 480 

Asn Ala Gly Glu Thr Trp Tyr Asp He Thr Gly Asn Arg Ser Asp Thr 
465 490 495 

Val Lys He Gly Ser Asp Gly Trp Gly Glu Phe His Val Asn Asp Gly 
500 505 510 

Ser Val Ser He Tyr Val Gin Lys 

515 520 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 548 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 35: 

Val Leu Thr Phe His Arg He He Arg Lys Gly Trp Met Phe Leu Leu 
15 10 15 

Ala Phe Leu Leu Thr Ala Ser Leu Phe Cys Pro Thr Gly Arg His Ala 

20 25 30 

Lys Ala Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp 
35 40 45 

Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala 
50 55 €0 

Asn Asn Leu Ser Ser Leu Gly He Thr Ala Leu Ser Leu Pro Pro Ala 
65 70 75 60 

Tyr Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu 
85 90 95 

Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr 
100 105 HO 
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Gly Thr Lys Alt Gin Tyr Leu Gin Ala He Gin Ala Ala Kis Ala Ala 

115 120 125 

Gly Net Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala 
130 135 140 

Asp Gly Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg 
145 150 155 160 

Asn Gin Glu He Ser Gly Thr Tyr Gin He Gin Ala Trp Thr Lys Phe 
165 170 175 

Asp Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp 
180 185 190 

Tyr His Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg 

195 200 205 

He Tyr Lys Phe Arg Gly He Gly Lys Ala Trp Asp Trp Glu Val Asp 

210 215 220 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met 
225 230 235 240 

Asp His Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr 
245 250 255 

Val Asn Thr Thr Asn He Asp Gly Phe Arg Leu Asp Gly Leu Lys His 
260 265 270 

He Lys Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin 
275 280 265 

Thr Gly Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He 
290 295 300 

Asn Lys Leu His Asn Tyr He Thr Lys Thr Asn Cly Thr Met Ser Leu 
305 310 315 320 

Phe Asp Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly 
325 330 335 

Gly Ala Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp 
340 345 350 

Gin Pro Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Asn Pro 
355 360 365 

Ala Lys Arg Cys Ser His Gly Arg Pro Trp Phe Lys Pro Leu Ala. Tyr 
370 375 380 

Ala Phe He Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly 
385 390 395 400 

Asp Tyr Tyr Gly He Pro Gin Tyr Asn He Pro Ser Leu Lys Ser Lys 
405 410 415 

He Asp Pro Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin 
420 425 430 

His Asp Tyr Leu Asp His Ser Asp He He Gly Trp Thr Arg Glu Gly 
435 440 445 

Val Thr Glu Lys Pro Gly Ser Gly Leu Ala Ala Leu' He Thr Asp Gly 
450 455 460 

Ala Gly Arg Ser Lys Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys 
465 470 475 460 

Val Phe Tyr Asp Leu Thr Cly Asn Arg Ser Asp Thr Val Thr He Asn 
485 490 495 

Ser Asp Gly Trp Cly Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val 
500 505 510 
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Trp Val Pro Arg Lys Thr Thr Val Str Thr lie XI a Arg Pro He Thr 
515 520 525 

Thr Arg Pro Trp Thr Cly Clu Phe Val Arg Trp His Glu Pr Arg Leu 
530 535 540 

Val Ala Trp Pro 
545 

(2) INFORMATION FOR SEQ ID NO: 36: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

(B) TYPE: amino acid 

(C) 5TRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro 
1 5 10 15 

Asn Asp Gly Gin His Trp Lys Arg Leu Gin Asn Asp Ser Ala Tyr Leu 
20 25 30 

Ala Glu His Cly He Thr Ala Val Trp He Pro Pro Ala Tyr Lys Gly 
35 40 45 

Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu 
50 55 60 

Gly Glu Phe His Gin Lys Gly Thr Val Arg Thr Lys Tyr Cly Thr Lys 
65 70 7 * 80 

Gly Glu Leu Gin Ser Ala He Lys Ser Leu His Ser Arg Asp He Asn 
85 90 95 

Val Tyr Gly Asp Val Val He Asn His Lys Gly Cly Ala Asp Ala Thr 
100 105 HO 

Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val 

115 120 125 

He Ser Gly Glu His Leu He Lys Ala Trp Thr His Phe His Phe Pro 
130 135 140 

Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 
145 150 155 160 

Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg He Tyr Lys 
165 170 175 

Phe Gin Gly Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn 
180 185 190 

Tyr Asp Tyr Leu Thr Tyr Ala Asp He Asp Tyr Asp His Pro Asp Val 
195 200 205 

Ala Ala Glu He Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin 
210 21.5 220 

Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He Lys Phe Ser Phe 

225 230 235 240 

Leu. Arg Asp Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met 
245 250 255 

Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn 
260 265 270 
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Tyr Leu Asn Lys Thr Asn Phe Asn Hi* Ser Val Phe Asp Val Pro Leu 
275 260 265 

His Tyr Gin Phe His Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Met 
290 295 300 

Arg Lys Leu Leu Asn Gly Thr Val Val Ser Lys Kis Pro Leu Lys Ser 
305 310 315 320 

Val Thr Phe Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu 

325 330 335 

Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu 
340 345 350 

Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly 
355 360 365 

Thr Lys Gly Asp. Ser Gin Arg Glu He Pro Ala Leu Lys His Lys He 
370 375 380 

Glu Pro He Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His 
3B5 390 395 400 

Asp Tyr Phe Asp His His Asp He Val Gly Trp Thr Arg Glu Gly Asp 
405 410 415 

Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro 
420 425 430 

Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr 
435 440 445 

Trp His Asp He Thr Gly Asn Arg Ser Glu Pro Val Val He Asn Ser 
450 455 460 

Glu Gly Trp Gly Glu Phe Kis Val Asn Gly Gly Ser Val Ser He Tyr 
465 470 475 480 

Val Gin Arg 

(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 487 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: 5EQ ID NO: 37 : 

Ala Ala Ala Ala Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr Phe Glu 

15 10 15 

Trp Tyr Met Pro Asn Asp Gly Gin His Trp Lys Arg Leu Gin Asn Asp 

20 25 30 

Ser Ala Tyr Leu Ala Glu His Gly He Thr Ala Val Trp He Pro Pro 
35 40 45 

Ala Tyr Lys Gly Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala Tyr Asp 
50 55 60 

Leu Tyr Asp Leu Gly Glu Phe His Gin Lys Gly Thr Val Arg Thr Lys 
65 . 70 75 80 

Tyr Gly Thr Lys Gly Glu Leu Gin Ser Ala He Lys Ser Leu His Ser 
85 90 95 
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Arg Asp II* Asn Val Tyr Oly Asp Val Val lie Asn His Lys Oly Gly 

100 105 110 

Ala Asp Ala Thr Glu Asp Val Thr Ala Val Glu Val Asp Pro Ala Asp 
115 120 125 

Arg Asn Arg Val He Ser Gly Glu His Leu He Lys Ala Txp Thr His 
130 135 140 

Phe His Phe Pro Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys Trp His 
145 150 155 160 

Trp Tyr His Pht Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys Leu Asn 

165 170 175 

Arg Zle Tyr Lys Phe Gin Gly Lys Ala Trp Asp Trp Glu Val Ser Asn 

160 185 190 

Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp He Asp Tyr Asp 

195 200 205 

His Pro Asp Val Ala Ala Glu He Lys Arg Trp Gly Thr Trp Tyr Ala 
210 215 220 

Asn Glu Leu Gin Leu Asp Gly Phe Arg Leu Asp Ala Val Lys His He 
225 230 235 240 

Lys Phe Ser Phe Leu Arg Asp Trp Val Asn His Val Arg Glu Lys Thr 
245 250 255 

Gly Lys Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp Leu Gly 
260 265 270 

Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Phe Asn His Ser Val Phe 

275 2B0 285 

Asp Val Pro Leu His Tyr Gin Phe His Ala Ala Ser Thr Gin Gly Gly 
290 295 300 

Gly Tyr Asp Met Arg Lys Leu Leu Asn Gly Thr Val Val Ser Lys His 

305 310 315 320 

Pro Leu Lys Ser Val Thr Phe Val Asp Asn His Asp Thr Gin Pro Gly 
325 330 335 

Cln Ser Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr 
340 345 350 

Ala Phe He Leu Thr Arg Glu Ser Oly Tyr Pro Gin Val Phe Tyr Gly 
355 360 365 

Asp Met Tyr Gly Thr Lys Gly Asp Ser Gin Arg Glu He Pro Ala Leu 
370 375 380 

Lys His Lys He Glu Pro He Leu Lys Ala Arg Lys Gin Tyr Ala Tyr 
385 390 395 400 

Gly Ala Gin His Asp Tyr Phe Asp His His Asp He Val Gly Trp Thr 
405 410 415 

Arg Glu Gly Asp Ser Ser Val Ala Asn Ser Gly Leu Ala Ala Leu He 
420 425 430 

Thr Asp Gly Pro Gly Gly Ala Lys Arg Met Tyr Val Gly Arg Gin Asn 
435 440 445 

Ala Gly Glu Thr Trp His Asp He Thr Gly Asn Arg Ser Glu Pro Val 
450 455 460 

Val He Asn Ser Glu Gly Trp Gly Glu Phe His Val Asn Gly Gly Ser 
465 470 475 480 

Val Ser He Tyr Val Gin Arg 
485 
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(2) INFORMATI ON FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Lys Gin Gin Lys Arg Leu Thr Ala Arg Leu Leu Thr Leu Leu Phe 
15 10 15 

Ala Leu lie Phe Leu Leu Pro His Ser Ala Ala Ala Ala Ala Asn Leu 

20 25 30 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Arg* Ser Lys Thr Leu Trp lie Ser Leu Leu Phe Ala Leu Thr Leu 
15 10 15 

lie Phe Thr Met Ala Phe Ser Asn Met Ser Ala Gin Ala Ala Gly Lys 
20 25 30 

Ser 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Arg Ser Lys Thr Leu Trp lie Ser Leu Leu Phe Ala Leu Thr Leu 
1 5 10 15 

lie Phe Thr Met Ala Phe Ser Asn Met Ser Ala Gin Ala Ala Ala Ala 

20 25 30 

Ala Ala Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 41: 

ii) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 amino acids 

(B) TYPE: amin acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41; 

Met Arg Ser Lys Thr Leu Trp lie Ser Leu Leu Phe Ala Leu Thr Leu 
15 10 15 

lie Phe Thr Met Ala Fhe Ser Asn Met Ser Ala Gin Ala Ala Asn Leu 
20 25 30 

(2) INFORMATION POR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

* 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CACCTAATTA AAGCTTTCAC ACATTTTCAT TTT 33 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 43: 
CACCTAATTA AAGCTTACAC ACATTTTCAT TTT 33 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
CCGCGTAATT TCCGGAGAAC ACCTAATTAA AGCCGCAACA CATTTTCATT TTCCCGGGCG 60 
CGGCAG 66 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
CC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
CCGGAGAACA CCTAATTAAA GCCCTAACAC ATTTTCXTTT TC 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
CCGGAGAACA CCTAATTAAA GCCCACACAC ATTTTCATTT TC 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CCGGAGAACA CCTAATTAAA GCCTGCACAC ATTTTCATTT TC 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(6) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
GATGCAGTAT TTCGAACTGG TATA 
(2) INFORMATION FOR SEQ ID NO:49: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
TGCCCAATGA TGGCCAACAT TGGAAG 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

CGAATGGTAT OCTCCCAATG ACCC 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 

CGAATGGTAT CGCCCCAATG ACGG 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 24 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52 
CGAATGGTAT AATCCCAATG ACGG 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(Di TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 
CGAATGGTAT GATCCCAATG ACGG 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CGAATGGTAT CACCCCAATG ACGG 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bate pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : •ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (ganomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CGAATGGTAT AAACCCAATG ACCC 
(2) INFORKATION FOR SEQ ID NO: 56: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
CGAATGGTAT CCGCCCAATG ACGG 24 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 
CGAATGGTAT TCTCCCAATG ACGG 24 
(2) INFORMATION FOR SEQ ID NO;58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5B: 
CGAATGGTAC ACTCCCAATG ACGG 24 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 59: 
CGAATGGTAT GTTCCCAATG XCGG 24 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DKA (oenoroic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CGAATGGTAT TGTCCCAATG ACGC 24 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CGAATGGTAT CAACCCAATG ACGG 24 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CGAATGGTAT CAACCCAATG ACGG 24 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CGAATGGTAT GGTCCCAATG ACGG 24 
(2) INFORMATION FOR SEQ ID NO: 64 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : • ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: SKA (gen mic) 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
CGAATGGTAT ATTCCCAATG ACCG 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CGAATGGTAT TTTCCCAATG ACGG 
(2) INFORMATION FOR SEQ ID NO: 66: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 
CGAATGGTAC TGGCCCAATG ACGG 
(2) INFORMATION FOR SEQ ID NO: 6*7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CGAATGGTAT TATCCCAATG ACGG 
(2) INFORMATION FOR SEQ ID NO: 66; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic- acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CCGTCATTGG GACTACGTAC CATT 
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WHAT ZS CZJkZMEO IS: 

1. An ijnpr v d bleach-containing cleaning composition, the 
improvement comprising adding to the bleach-containing composition a 
mutant alpha-amylase that is the expression product of a mutated DNA 
sequence encoding an alpha-amylase, the mutated DMA sequence being 
derived from a precursor alpha-amylase by the substitution of a 
methionine at a position equivalent to M+197 in B. licheniformis 
alpha-amylase and the substitution of one or more methionine or 
tryptophan at a position equivalent to M+15 or W+13B in B. 
licheniformis alpha-amylase. 

2. An improved cleaning composition of Claim 1 wherein the 
cleaning composition is a dish care cleaning composition. 

3. An improved cleaning composition of Claim 1 wherein the mutant 
alpha-amylase is selected from the group consisting of M15T/M197T; 
M15S/M197T; W138Y/M197T; M15S/W138Y/M197T and M15T/W138Y/M197T. 

4. An improved cleaning composition of Claim 1 further comprising 
a mutant protease that is the expression product of a mutated DNA 
sequence encoding a protease, the mutated DNA sequence being derived 
from a precursor protease by the substitution of a methionine at a 
position equivalent to M+222 in Bacillus amyloliquefaciens protease. 

5. An improved cleaning composition of Claim 4 wherein the mutant 
protease comprises a substitution selected from the group of amino 
acids consisting of alanine, cysteine and serine. 

6. An improved cleaning composition of Claim 4 comprising an 
alpha-amylase mutant selected from the group consisting of 
M15T/M197T, M15S/M197T, W138Y/M197T, M15S/W138Y/M197T and 
M15T/W13BY/M197T, and a protease mutant selected from the group 
consisting of M222C, M222S and M222A. 

7. An improved clewing composition of Claim 6 which is a 
granular composition. 
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1 /25 

10 30 50 

AGCTTGAAGAAGTGAAGAAGCAGAGAGGCTATTGAATAAATGAGTAGAAAGCGCCATATC 

70 90 110 

GGCGCTTTTCTTTTGGAAGAAAATATAGGGAAAATGGTACTTGTTAAAAATTCGGAATAT 

130 150 170 

TTATACAACATCATATGTTTCACATTGAAAGGGGAGGAGAATCATGAAACAACAAAAACG 

M K Q Q K R 

190 210 230 

GCTTTACGCCCGATTGCTGACGCTGTTATTTGCGCTCATCTTCTTGCTGCCTCATTCTGC 
LYARLLTLLFAL I FLLPHSA 

250 270 290 

AGCAGCGGCGGCAAATCTTAATGGGACGCTGATGCAGTATTTTGAATGGTACATGCCCAA 
A A A ANLNG T LMQYFEWYMPN 

310 330 350 

TGACGGCCMCATTGGMGCGTTTGCAAAACGACTCGGCATATTTGGCTGAACACGGTAT 
DGQHWKRLQN DS AYLAEHG I 

370 390 410 

TACTGCCGTCTGGATTCCCCCGGCATATAAGGGAACGAGCCAAGCGGATGTGGGCTACGG 
TAVW IP P AYKGTS QA DVGYG 

430 450 470 

TGCTTACGACCTTTATGATTTAGGGGAGTTTCATCAAAAAGGGACGGTTCGGACAAAGTA 

AYDLYDLG E FHQKG TVR T KY 

490 510 530 

CGGCACAAAAGGAGAGCTGCAATCTGCGATCAAAAGTCTTCATTCCCGCGACATTAACGT 
GTKG E LQSA I KSLHS R DINV 

550 570 590 

TTACGGGGATGTGGTCATCAACCACAAAGGCGGCGCTGATGCGACCGAAGATGTAACCGC 
YG D V V INHKG GADA TE DVTA 

610 630 650 

GGTTGAAGTCGATCCCGCTGACCGCAACCGCGTAATTTCAGGAGAACACCTAATTAAAGC 
VEVDPADRNRVISGEHLIKA 

670 690 710 

CTGGACACATTTTCATTTTCCGGGGCGCGGCAGCACATACAGCGATTTTAAATGGCATTG 
WTHFHFPG RGSTYSDFKWHW 

730 750 770 

GTACCATTTTGACGGAACCG^TTGGGACGAGTCCCGAAAGCTGAACCGCATCTATAAGTT 
YHFDGT DWD ES RK LNR IYKF 

790 810 830 

TCMGGAMGGCTTGGGATTGGGMGTTTCCAATGAAMCGGCMCTATGATTATTTGAT 
QGKAW DWEVSNENGNYDYLM 
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2/25 

850 870 890 

QTATGCCGACATCGATTATGACCATCCTGATGTCGCAGCAGAAATTAAGAGATGGGGCAC 
YADI DYDHPDVAAE IKRWGT 

910 930 950 

TTGGTATGCCAATGMCTGCAATTGGACGGTTTCCGTCTTGATGCTGTCAAACACATTAA 
WYANELQLDGFRLDAVKH IK 

970 990 1010 

ATTrTCTTTrTTGCGGGATTGGGTTAATCATGTCAGGGAAAAAACGGGGAAGGAAATGTT 
FSFL R DWVNHVR EKT G K EMF 

1030 1050 1070 

TACGGTAGCTGAATATTGGCAGAATGACTTGGGCGCGCTGGAAAACTATTTGAACAAAAC 
T VA EYWQNDLG A LENYLN KT 

1090 1110 1130 

AAATTTTAATCATTCAGTGTTTGACGTGCCGCTTCATTATCAGTTCCATGCTGCATCGAC 
NFNHSVFDVP LHYQFHAAST 

1150 1170 1190 

ACAGGGAGGCGGCTATGATATGAGGAAATTGCTGAACGGTACGGTCGTTTCCAAGCATCC 
QG GGYDMR K L LNGT V V S KHP 

1210 1230 1250 

GTTGAMTCGGTTACATTTGTCGATAACCATGATACACAGCCGGGGCAATCGCTTGAGTC 
LKS VTFVDNHDTQPGQS LES 

1270 1290 1310 

GACTGTCCAAACATGGTTTAAGCCGCTTGCTTACGCTTTTATTCTCACAAGGGAATCTGG 
TVQTWFKP LAYAFILTR ESG 

1330 1350 1370 

ATACCCTCAGGTTTTCTACGGGGATATGTACGGGACGAAAGGAGACTCCCAGCGCGAAAT 
YPQVFYG DMYG TKG DSQR E I 

1390 1410 1430 

TCCTGCCTTGAAACACAAAATTGAACCGATCTTAAAAGCGAGAAAACAGTATGCGTACGG 
PALKHKIEP I LKA RKQYAYG 

1450 1470 1490 

AGCACAGCATGATTATTTCGACCACCATGACATTGTCGGCTGGACAAGGGAAGGCGACAG 
AQ HDYFDHHD IVGWTR EG DS 

1510 1530 1550 

CTCGGTTGCAAATTCAGGTTTGGCGGCATTAATAACAGACGGACCCGGTGGGGCAAAGCG 
SVANSGL A ALITDGPGG AKR 

1570 1590 1610 

AATGTATGTCGGCCGGCAAAACGCCGGTGAGACATGGCATGACATTACCGGAAACCGTTC 
MYVG R QNA GE TWHDITG N RS 

1630 1650 1670 

GGAGCCGGTTGTCATCAATTCGGAAGGCTGGGGAGAGTTTCACGTAAACGGCGGGTCGGT 
E PVV I NSEGWGEFHVNGGS V 
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1690 1710 1730 

TTCMTTTATGTTCAAAGATAGAAGAGCAGAGAGGACGGATTTCCTGAAGGAAATCCGTT 

S I Y V Q R * 

1750 1770 1790 

TTTTTATTTTG C C CGTCTTATAAATTTCTTTG ATTA CATTTTATAATTAATTTTAAC AAA 

1810 1830 1850 

GTGTCATCAGCCCTCAGGAAGGACTTGCTGACAGTTTGAATCGCATAGGTAAGGCGGGGA 



1870 1890 1910 

TGAAATGGCAACGTTATCTGATGTAGCAAAGAAAGCAAATGTGTCGAAAATGACGGTATC 



1930 1950 
GCGGGTGATCAATCATCCTGAGACTGTGACGGATGAATTGAAAAAGCT 



FIG.-1C 



FIG.-1A 



1 



FIG.. 1 



SUBSTITUTE SHEET (RULE 26) 



WO 96/05295 



PCIYUS95/10426 



4/25 

10 30 50 

ANLNGTLMQYFEWYMPNDGQHWKRLQNDSAYLAEHG ITAVWI PPAYKGTSQADVGYGA YD 

70 90 110 
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BJicheniformis alpha-amylase . (Pstl) 
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N-terminus 

B.subtilis alkaline protease aprE. (Pstl) 

MRSKTL Wl SLLFAL TLI FTMAFSNMSAQA^G KS 

N-terminus 

BJicheniformis alpha-amylase in pA4BL (Pstl) 
MRSKTL WISLL FA LTLI FT MA FSNMSA QA^A A A A N. 

N-terminus 

B.lichenfiormis alpha-amylase in pBLapr , 

MRSKTL WISLLFALTLIFTMAFSNMSAQA^lNL 

N-terminus 

(Pstl) indicates the site of the restriction site in the gene. 
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N-terminus indicates cleavage site between signal peptide and secreted protein. 
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